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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Tyr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv, zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Xaa Ser 
15 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO : 3: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AACTTCTCCN AGNGTNGCNC CNGC TTGNAC CCA 3 3 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 
GATCCNTCTT T C C CN ATT AC TGGNGAGGTT TA 3 2 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GAT CCG TCT TTC CCG ATT ACT GGG GAG GTT TAC ACT CCC GGA AAC TCA 4 8 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
15 10 15 

TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT 96 
Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn 
20 25 30 

GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 
Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr Ala Glu His Val 
35 40 45 

TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 
Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 24 0 
Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 80 

ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG 2 88 
Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met Phe Asn Leu Arg 
85 90 95 

TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC 
336 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
100 105 110 

GCC ACC CTC GGA GAA GTT 3 54 

Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 



Asp Pro Ser Phe 
1 

Ser Phe Pro Thr 
20 

Glu Thr Thr Thr 
35 

Ser His lie Gin 
50 

Leu Lys Thr Arg 
65 

Thr Asn Thr Asn 



Ser lie Asn Val 
100 

Ala Thr Leu Gly 
115 



Pro lie Thr Gly 
5 

Val Leu Gin Asn 



Pro Lys Pro Phe 
40 

Ala Ala Val Val 
55 

Ser Gly Gly His 
70 

Gin Pro Phe Phe 
85 

Asp lie Glu Gin 



Glu Val 



Glu Val Tyr Thr 
10 

Tyr lie Arg Asn 
25 

Leu lie lie Thr 



Cys Gly Lys Gin 
60 

Asp Tyr Glu Gly 
75 

lie Val Asp Met 
90 

Glu Thr Ala Trp 
105 



Pro Gly Asn Ser 
15 

Leu Arg Phe Asn 
30 

Ala Glu His Val 
45 

Asn Arg Leu Leu 



Leu Ser Tyr Leu 
80 

Phe Asn Leu Arg 
95 

Val Gin Ala Gly 
110 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) ^ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CAGGCAGCTG TGGTTTGTGG C 



(2) INFORMATION FOR SEQ ID NO : 8: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
GTCCACAATG AAGAAGGGTT G 21 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
ACGTAGATAT CGAACAAGAA ACCGC 2 5 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCTTTACTAC ACGGGCTTCC CCAG 24 



(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTGGGGAAGC CCGTGTAGTA AAGC 2 4 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i') SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTACTCCAA CCACGGCGCT C 21 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



CDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGGAAGTTG CAGAAGATTG GGTTG 2 5 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAGCAAGAGA AGAAGGAGAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1784 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 21.. 1608 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 5 0 



Met Gin Thr Ser lie Leu Thr Leu Leu Leu 
15 10 



CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 
Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 
15 20 25 

CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 14 6 
Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro lie 
30 35 40 

ACC GGA GAG GTT TAC ACT CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG 194 
Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser Ser Phe Pro Thr Val Leu 
45 50 55 

CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA 2 42 
Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys 
60 65 70 

CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT 2 90 
Pro Phe Leu lie lie Thr Ala Glu His Val Ser His lie Gin Ala Ala 
75 80 85 90 

GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT 33 8 

Val Val Cys Gly Lys Gin Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly 

95 100 105 

GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC 3 86 

Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro 

110 115 120 

TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC 4 34 
Phe Phe lie Val Asp Met Phe Asn Leu Arg Ser lie Asn Val Asp lie 
125 130 135 

GAA CAA GAA ACC GCA TGG GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG 4 82 

Glu Gin Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly Glu Val 
140 145 150 

TAC TAT CGA ATA GCG GAG AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG 53 0 

Tyr Tyr Arg lie Ala Glu Lys Ser Asn Lys His Gly Phe Pro Ala Gly 
155 160 165 170 

GTT TGT CCA ACG GTT GGC GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT 5 78 

Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr 
175 180 185 

GGT AAT TTG ATG AGA AAA TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT 62 6 

Gly Asn Leu Met Arg Lys Tyr Gly Leu Ser Val Asp Asn lie Val Asp 
190 195 200 

GCT CAA ATA ATA GAT GTG AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG 674 
Ala Gin lie lie Asp Val Asn Gly Lys Leu Leu Asp Arg Lys Ser Met 
205 210 215 



GGT GAG GAT TTG TTT TGG GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT 72 2 

Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val Ser Phe 
220 225 230 

GGT GTG GTT CTA GCC TAC AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT 77 0 

Gly Val Val Leu Ala Tyr Lys lie Lys Leu Val Arg Val Pro Glu Val 
235 240 245 250 

GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 818 
Val Thr Val Phe Thr lie Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 
255 260 265 

ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 8 66 

lie Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 
270 275 280 

TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 
Phe Leu Arg Met Thr Phe Ser Val lie Asn Asp Thr Asn Gly Gly Lys 
285 290 295 

ACA GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 
Thr Val Arg Ala lie Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 
300 305 310 

CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
315 320 325 330 

TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 10 5 8 
Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 
335 340 345 

GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 
Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 
350 355 360 

CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 1154 
Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr Val Gin Asn Pro 
365 370 375 

ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 12 02 
lie Ser Lys Arg Gin Phe Glu Phe lie Phe Glu Arg Leu Lys Glu Leu 
380 385 390 

GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA 12 5 0 
Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu 
395 400 405 410 

ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 12 98 
lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn lie Ala 
415 420 425 

AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 134 6 
Lys lie Gin Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 



430 



435 



440 



AAT CGT TAC TTG AAT TTC ACA AGG CTG ATG TAT GAT TAC ATG ACC CCA 13 94 
Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 
445 450 455 

TTT GTG TCG AAA AAC CCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 144 2 
Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 
460 465 470 

GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 14 90 
Asp lie Gly lie Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 
475 480 485 490 

GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA 15 3 8 
Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 
495 500 505 

AGT GTG AAG ACT AAA GTT GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA 1586 
Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 
510 515 520 

AGC ATC CCA ACT TTG TCA TCT T G AAG AAC GTA CATATATAAA TAAATACCTT 163 8 

Ser lie Pro Thr Leu Ser Ser 
525 

TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA GAATTTTGAC 1698 
TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA ATTCTGAAAA 175 8 
ATACATATTA GTATTGT CAA AAAAAA 17 84 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 



50 



55 



60 



Leu Arg Phe Asn 
65 

Ala Glu His Val 



Asn Arg Leu Leu 
100 

Leu Ser Tyr Leu 
115 

Phe Asn Leu Arg 
130 

Val Gin Ala Gly 
145 

Lys Ser Asn Lys 



Val Gly Gly His 
180 

Tyr Gly Leu Ser 
195 

Asn Gly Lys Leu 
210 

Ala lie Thr Gly 
225 

Lys lie Lys Leu 



Glu Arg Arg Glu 
260 

Gin Val Ala Asp 
275 

Ser Val lie Asn 
290 

Pro Thr Leu Tyr 
305 

Lys Asp Phe Pro 



Ser Trp Val Glu 



Glu Thr Thr Thr 
70 

Ser His lie Gin 
85 

Leu Lys Thr Arg 



Thr Asn Thr Asn 
120 

Ser lie Asn Val 
135 

Ala Thr Leu Gly 
150 

His Gly Phe Pro 
165 

Phe Ser Gly Gly 



Val Asp Asn lie 
200 

Leu Asp Arg Lys 
215 

Gly Gly Gly Val 
230 

Val Arg Val Pro 
245 

Glu Gin Asn Leu 



Lys Leu Asp Arg 
280 

Asp Thr Asn Gly 
295 

Leu Gly Asn Ser 
310 

Glu Leu Gly Leu 
325 

Ser Val Leu Tyr 



Pro Lys Pro Phe 
75 

Ala Ala Val Val 
90 

Ser Gly Gly His 
105 

Gin Pro Phe Phe 



Asp lie Glu Gin 
140 

Glu Val Tyr Tyr 
155 

Ala Gly Val Cys 
170 

Gly Tyr Gly Asn 
185 

Val Asp Ala Gin 



Ser Met Gly Glu 
220 

Ser Phe Gly Val 
235 

Glu Val Val Thr 
250 

Ser Thr lie Ala 
265 

Asp Leu Phe Leu 



Gly Lys Thr Val 
300 

Arg Asn Leu Val 
315 

Gin Glu Ser Asp 
330 

Tyr Thr Gly Phe 



Leu lie lie Thr 
80 

Cys Gly Lys Gin 
95 

Asp Tyr Glu Gly 
110 

lie Val Asp Met 
125 

Glu Thr Ala Trp 



Arg lie Ala Glu 
160 

Pro Thr Val Gly 
175 

Leu Met Arg Lys 
190 

lie lie Asp Val 
205 

Asp Leu Phe Trp 



Val Leu Ala Tyr 
240 

Val Phe Thr lie 
255 

Glu Arg Trp Val 
270 

Arg Met Thr Phe 
285 

Arg Ala lie Phe 



Thr Leu Leu Asn 
320 

Cys Thr Glu Met 
335 

Pro Ser Gly Thr 



340 



345 



350 



Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe lie Phe Glu Arg Leu Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 



Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CCGCCATGGA GACTTCCATT CTTACTC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGGATCCT CAAGATGACA AAGTTGGGAT GCT 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 
Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 



TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 
Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 



20 



25 



30 



CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAC ACT 14 4 
His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC 192 
Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 24 0 
Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 2 88 
Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 33 6 
Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 3 84 
Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 4 32 
Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG TAC TAT CGA ATA GCG GAG 4 80 
Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 52 8 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 5 76 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 62 4 
Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 672 
Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAC 72 0 
Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 



/ 



AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT GTG ACC GTG TTT ACC ATT 76 8 
Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC ATC GCG GAA CGA TGG GTA 816 
Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 

CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT TTC CTT CGA ATG ACC TTT 864 
Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG ACA GTC CGT GCT ATC TTT 912 
Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 

CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT CTT GTT ACA CTT TTG AAT 960 
Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA TCG GAT TGT ACT GAA ATG 100 8 
Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG GGC TTC CCC AGT GGT ACT 105 6 
Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT CAA AGA CTC AAC CCA TTC 1104 
Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 
Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

GAG TTC ATC TTC GAA AGG ATG AAA GAA CTT GAA AAC CAA ATG TTG GCG 12 0 0 
Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA ATA TCC GAA TTC GCA AAG 124 8 
Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG AAG ATT CAA TAC GAA GTA 12 96 
Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 13 44 
Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

ACA AGG CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 



Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 



AGA GAA GCA TTT TTG AAC TAT AGG GAT TTG GAT ATT GGT ATC AAC AGC 144 0 
Arg Glu Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG GTT TAT GGG CAC AAA TAT 14 88 
His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA AGT GTG AAG ACT AAA GTT 15 3 6 
Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA AGC ATC CCA ACT TTG TCA 15 84 
Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 

TCT TG 15 8 9 

Ser 

530 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 
, (A) LENGTH: 52 9 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 20: 

Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 



Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 



Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

Phe Asn Leu Arg Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp 
130 135 140 

Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu 
145 150 155 160 

Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

Tyr Gly Leu Ser Val Asp Asn lie Val Asp Ala Gin lie lie Asp Val 
195 200 205 

Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

Ala lie Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 

Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 

Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 

Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 ' 360 365 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 



Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 



Phe Asn Pro Tyr Gly 
405 

Pro Phe Pro His Arg 
420 

Asn Trp Glu Asp Leu 
435 



Thr Arg Leu Met Tyr 
450 

Arg Glu Ala Phe Leu 
465 

His Gly Arg Asn Ala 
485 

Phe Lys Glu Thr Asn 
500 

Asp Pro Asp Asn Phe 
515 



Gly Arg Met Ser Glu lie 
410 

Ser Gly Asn lie Ala Lys 
425 

Ser Asp Glu Ala Glu Asn 
440 



Asp Tyr Met Thr Pro Phe 
455 

Asn Tyr Arg Asp Leu Asp 

470 475 

Tyr Thr Glu Gly Met Val 
490 

Tyr Lys Arg Leu Val Ser 
505 

Phe Arg Asn Glu Gin Ser 
520 



Ser Glu Phe Ala Lys 
415 

lie Gin Tyr Glu Val 
430 

Arg Tyr Leu Asn Phe 
445 



Val Ser Lys Asn Pro 
460 

lie Gly lie Asn Ser 
480 

Tyr Gly His Lys Tyr 
495 

Val Lys Thr Lys Val 
510 

lie Pro Thr Leu Ser 
525 



Ser 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .350 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GAGAAACTCG GAGACTTTCA CACAATGCCT AACCTCAAAC TCCGACCCCA AACATCCCAT 60 



CTCCCCCGCT ATCTTCTTCT CCGGAAATGG CTCCTACTCC TCCGTATTAC AAGCCAACAT 12 0 
CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA TCGCCGCAAC 18 0 
AC ATGAATC C CATGTGCAAG CCGCGATTAC TTGCGGGAAA CGCCACAACC TTCAGATGAA 24 0 
AATCAGAAGT GGAGGCCACG ACTACGATGG CTTGTCATAC GTTACATACT CTGGCAAACC 3 00 
GTTCTTCGTC CTCGACATGT TTAACCTCCG TTCGGTGGAT GTCGACGTGG 3 50 



(2) INFORMATION FOR SEQ ID NO : 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix)' FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT GGGAGAAAAG 60 

CAGAGTCCAT GGATTCCCCG CCGGAGTTTG ACCGACGGTT GGTGTTGGTG GGCATTTAAG 12 0 

CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA TCTGTGGATT ACGTTGAGGA 18 0 

TGCCAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG . AAAGCAATGG GTGAGGATCT 24 0 

GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 2 78 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



( D ) TOPOLOGY : linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT TAGCGTCTGA 6 0 

TAACGTTGTG GACG CACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC GGAAAACGAT 12 0 

GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG GCGTTGTCTT 180 

GTCGTGGAAG GTTAAGCTTG CTAGGGTTCC TGAAAAGGTA ACTTGTTTCA TAAGTCAACA 24 0 

TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG GATCAAGANN 3 00 

GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 34 5 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . 695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA NGNAACC CAA 60 

NATTTCTGAA CTGGCCNCCT TCGGTGGTAT ATGNCNAAAN CCCTTGAATC TGCGNANCCN 12 0 

ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT GNCTAGANNG 180 

AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA TTACATGGCG 24 0 

CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT AGACTTCGGA 3 00 

CAGAACAAGA AC AACG C AAA GGTTAACTTC ATTGAAGCTA AAATCTGGGG ACCTAAGTAC 3 60 

TTCAAAGGCA ATTTTGACAG ATTGGTGAAG ATTAAAACCA AGGTTGATCC AGAGAACTTC 42 0 

TTCAGGCACG AG C AGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC ATGAAACCAA 4 80 

TAACATTATC AAAAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG KCTTTCTTTA 54 0 

TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA AAAAAAAATA 600 

ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT NGGACGGGGN 660 

GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 695 

(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

<ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2 . .4 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT TTCCTGAATT 6 0 
NAGTTTAAAG AAGGAGAACT GTTCGGAGAT GACTTGGTTT CAGTCAGCTT TATGGTGGGA 12 0 



TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA ATCTTGATAG 18 0 



AGCGAATTTC GGAAAGAGGA AATCGGATTA CGTTGCGAGT AAGATTCCTA GAGATGGGAT 24 0 



TAAGYCTTTT TCCAAGARGA TGMCTGACCT 
TGGTGGGAAA ATGGCGGAGG TTACGGTTAA 
TTTTAAGATT CAGTACTCGG TGACTTNGCA 
TCTTGAATCA GGCTAACGTC CTTATAGGTT 
AATNCTTACT TNAAT 



GGGGAAAAYC GGGCTTGTTT TTAAWC CGT A 3 00 
CGCGACGCCG TTTCCNCACC GAAGCAAGCT 3 60 
AGAAAACTCT NTCGAGATAG AGAAAGGGTT 420 
CATGACCGGG TTTTTNAGCA AGANCCCTGG 480 

495 



(2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . .2 04 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG CTTGGTTTCA ACCTGGTGCT 6 0 

ACGCTTGGTG AGCTTTACTA CAGAATTNCA GAGAAGAGCA AAATCCATGG ATTTCCNGCG 12 0 

GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA CGGTACCTTG 18 0 

ATGAGGAAGT ATGGTCTTNC GGGA 2 04 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE : cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 4 91 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27: 
GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG CCATTGGGTG GGAAAATCAG 60 
CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT ATAATATACA 12 0 



GTACATGGTG AAATGGAAAG TGAATGANGT 

GAGATCGTTA CACGATTACA TGACTCCGTA 

GANTTACAGA GATCTTGATT TGGGCTCGAC 

AAGGAAATGG NNGGGTGAGN CTTTTTTCAA 

AAAGGGGAGG TTTNNCCCAN CAAATTTTTT 

TNTTTTTNGG NCCCCAATCN AAANCCCCGT 

NNTTAAAAGG G 



CGAGGAGATG AACAAACATG TCAGGTGGAT 18 0 

TGTTTCTAAA TCGCCGAGAG GAGCTTATTT 24 0 

CAAAGGGATT AACACGGGTT TCGGAGATGC 3 00 

AGGTAATTTC CAAGGGGTTA GGTTTTGGTT 3 60 

TTCAGGANCC GGCCANGNTT TTCCCCCCCC 42 0 

TTTAAAAGGG GGGCCATTTC NTTTTTTNCA 4 80 

491 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : doub 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 



(A) NAME/ KEY: CDS 

(B) LOCATION: 3 . .4 07 



(xi) SEQUENCE DESCRIPTION: S] 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG 

ACCGGGATTT CCCGGAGTTG AAGCTGAAGA 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC 

TCGAGAACCC GATCTCNNGT TTGATACTCA 

TNAATTTGGA TCCGCATTNN GGNANNNATG 



:Q ID NO : 2 8: 

TCAACGGTAC GAAGCCTGGT GAGAATACGG 6 0 

GCCGGTCGGA TAAGCTGTTG AC CGTNATGA 12 0 

AAACCGATTN TACCGAGATG AGATGGATCG 18 0 

TTGGTACACC GACTTCTGTG CTACTAAATC 24 0 

GAAAATCGGA CTACGTGAAG CGTCTNATTT 3 00 

AGAAATTTGT AGAGGTTNNG AAAGTTAAAA 3 60 

GTGAAACCCC NNGTTNT 4 07 



(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . .3 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC GTCACCGTTT 60 

TCAGAATCTC CCGGACGTTA GAACAAAATG CGACGGATAT CATTCACCGG TGGCAACAAG 12 0 

TTGCACCGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA NAAACGGCAC 180 

TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAGC AATGTTTCTA GGAGACACGN 24 0 

CAACTCTACT GTCGATATTA AACCGGAGAT TCCCAGAATT GGGTTTGGTC CGGTCTGACT 3 00 



GTACCGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA TCCCAGGTTG 360 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT TCTAAGTGGC 60 

AACAAATTTC ATCCAAGATT ATTGAAGAGA TACACATCCG AGTGGTACTC AGAGCAGCTG 12 0 

GAAATGATGG AAACAAGACT GTGACAATGA CCTACCTAGG TCAGTTTCTT GGCGAGAAAG 180 

GCACCTTGCT GAAGGTTATG GAGAAGG CTT TTCCAGAACT AGGGTTAACT CAAAAGGATT 24 0 

GTACTGAAAT GAGCTGGATT GAAGCCGCCC TTTTCCATGG TGGRTTTCCA ACAGGKTCTC 3 00 

CTATTGAAAT TTTGCTTMAG CTCAAGTCGC CTYTAGGAAA AGRTTWCTTC AAAGCAACGK 3 60 

CGGATTTCGT TAAAGAACCT WTTCCTGTGA TAGGGCTCAA AGGAATATTC AAAAGATTGA 42 0 

TTGAAGG 42 7 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI -SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 

GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC CGNGTTTAAA 60 

GTC CAGAAAC ATNAGGAGAA AGAGGCCGTT AGGNTCATCA ACAAGTGGCA GTATGTTGCG 12 0 

GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA CGGAAACTCT 180 

GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNC CGGTGA ATAATNTCTT GGCCTTGATG 24 0 

GAAGAAAAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT GAGTTGGGCT 3 00 

GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT GTTCCNCGCG 3 60 

GATCGGNAGC GGNCAATTTN TGGNC TTTCA AGGGGAAAGN CGGCTTTTTN CAAGAACCCG 42 0 

NTACCCGGGG TTCAATT 437 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..441 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCGGACCCTA TAGATCANNA TGTGCTACTG ANAGAAGAGG AAGCCAAGAA CAAGCCGGAG 6 0 



ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC 

AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT 

AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG TGAAGTATTT 

TT CGAGAG AT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT 

C AG AG CAT T C CTCTGGTGAA CAAAGTTACC TGAAGATATC ATTTGAAGTT 

CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT 
TGTGTGTGAT TGTGAATTGT G 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN : ^ ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGT AC AC C C A AGAGCATCCT 6 0 

TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT TTGTCAAAAA 12 0 

ACCAATACCC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT TCAACAGTAG 18 0 

CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG CAACGGCCAC 24 0 

CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA TGTGGTTTGA 3 00 

CGCAAACGCC ACACAGAGTA GCCNGGCTAT GATGAATGAG CTTTTTGAGG TGGCGGG AC C 3 60 

GTACGTGNGT CAAGTAAACC CGAGANANGG CTTCCTTTAA NTTCAGAGNC CATCGNTNTT 42 0 



ATATGTTTCG 12 0 
GTATCTTGGA 18 0 
CAAGAACAAT 24 0 
CTGCGATGAA 3 00 
TTTTATTAGT 3 60 
GTGTAACTTA 42 0 
441 



NGGAGCAANN CCAAGTGGGG GGGNCCAACC GGGGGNTNAA ANCNNAGNTC TTNGGGGGCC 4 80 



CAGAATTTCC TTNGGGGAAT TT 



502 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2 . . 400 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC ACCATTGGTT TCCCAAGAAN 6 0 

CCCGAGACAA CCGTTTTTCA ATNACCGTGA TGTTGATTTG GGTATTAATT CTCATAATGG 12 0 

TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG CAGGTAATTT 18 0 

CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA GGAACGAACA 24 0 

GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA AATTTGTTGT 3 00 

TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AATTTATAAA GTTTGATATT 3 60 

AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 4 00 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2 . .3 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACTCTAAT GGGAACACCT ACTTCGATCG AATGTCGATG GGGGAAGAGC TTTTCTGGGC 60 
GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA TCCGGTTGGT 12 0 
TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG CGTCGGAAAA ACCGTCGGAG AAGGAGCCGT 180 
TGATCTTATA ATGAAGTGGC AGAACTTCTC TCATAGTACG GNTCGGAATT TNTTTGTGAA 24 0 
GCTGANTTTT GANTTTAGTC AACGGTGCAA AGCCGGGTGA AAAAAAGGTT TTAGNGNCTT 3 00 
TCANTTTGGN TGNAANCTTG GGGGTTTTAT NAGAACGGTT AACCGGGATT NANCCCGNGT 3 60 
TTTCCCGGGG TTAAAACCTT NGG 3 83 



(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 
ATCAATGTTC TTACTAAACG TACACGAGCA TCGTTGGCTT TCAAGGCTAA ATCTGATTTT 6 0 
NTTCAAGAAC CGATNCCTAA AACCGCGATT TCGAAGCTTT GGAGACGGTT GCAAGAACCG 12 0 



GAAGCAGAGC ATGCTCAGCT AATTTNCACN CCATTTGGTG GTAAAATGAG TNAGATTGCA 180 



GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT ATNAGATTCA GTACTTGAAT 24 0 
TACTGGAGAG GAGACGTGAA AGAGAAGTAT ATTGAGATNG GTGGAGGAGA GTTTACGGTT 3 00 
GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT 3 54 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidposis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GCCACAAAAG TATCTGCAAT 6 0 

CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA AAGTGAAATC 12 0 

AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA TGAACCAGAA 180 

GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA TCGAATCGAT 24 0 

AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA GAGAC CTAAG 3 00 

GATTCGAGGA TCAGTATTTC AAAGCAAAGT CAGGATTATG GTTCAAAAAC CCGTTCCTGA 3 60 

AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 4 03 



(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .2 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 38: 

GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG AGAATCTCTT 6 0 

NGTGTTCTCA CGAATCGTAA GCGTACATCT CTATC TTTNA AAGGCAAAGA TGATTTTATC 12 0 

CAAGAACCGA TACCCGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA AGCCCCCNAG 18 0 

GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA AATGGCAGAG 24 0 

TACGTANCAC CATTCCCACA 2 60 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAAC C A CTCCCTACAA TTTTTTGTAT 6 0 



TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT CATCAGCCTC 12 0 

CATCCAAGAT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT TTCCACTCGA 180 

GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC TTGANTCGAC 24 0 

GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA TATTCAGACC 3 00 

GATTCACCAG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG GGNTTCATTT 3 60 

TNGTGTTTGA NGTGGCGGTC ACGATTTTCG AGGCCTTTGT NTTTATGTTT CACGGTTTGA 42 0 

AAAAACCGTT TATATTACTC GGCCTGTCAA ANTTGNANNC AAAATCANAT GTTGGATATT 4 80 

GNATTC CAAA TAGGTNCTTG GGGTNAACCT GGTGGCTANC GTTTGGTGAG CTTTTACTTT 54 0 

CAAGAATTTG CANGNGGANG TGCAAAGATT CCATGGGATT TCCCGGGGGG TTTNTTGCAC 6 00 

AATGT 605 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

AACACAAAAC TCTTCCATTT GGCTTCTCTC TTGCATATTC GTTGCAAGGA TGGGAAATTC 6 0 

AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN NATTCTATTG 12 0 

CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT GTGTCAAAAG 18 0 

GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG AAAACGGCTC 24 0 

TNTTATTCAA CGGGTCCNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG NAAAATCCAT 3 00 



GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG NTTCCATCAT 3 60 



TTGTTCAAAG GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN GGTTTCGGGG 42 0 
CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 4 64 



(2) INFORMATION FOR SEQ ID NO : 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

<ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 1..386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 

TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANCCGTTTAC NGAATTTTAT 60 

GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT CAAGGATATG 12 0 

GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA GAGTTGGGGA 18 0 

GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TGAAGACTAG GGTTGATCCN 24 0 

ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC CTGAAGATAT 3 00 

CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT GTTGAATANT 3 60 

ATGCGTCAAG TGTGTAACTT ATGTGT 3 86 



(2) INFORMATION FOR SEQ ID NO : 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TACCATAGGG AGGTGGTGNA AG AT TTTGT A TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 60 

TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 

TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 180 

AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 

ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT AT C AAAAAT A AGAATAAATG 3 00 

ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 3 60 

AT AAAATGT C ATTTACC 3 77 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 377 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 

TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 

TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 

AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 

ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 

ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 
ATAAAATGTC ATTTACC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 346 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC CGAATCTTTT 6 0 
TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA GAGCTTCTNT 12 0 
GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTCCTTAGTA AGGAGTTTCC 18 0 
TGAATTGGGT TTAAAGAAGG AG AATTNTT C GGAGATGACT TGGTTTCANT CTGCTTTATG 24 0 
GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATCCTAAA GTNTTTCTTG ATCGGAATCT 3 00 
CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 34 6 

(2) INFORMATION FOR SEQ ID NO : 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA GGTTTGGTTA AAGGGAAGNT 60 

TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT CCTCTGTTTT GAGTCCTCAA 12 0 

TACAAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA TTATAATAGA TAATGTAACT 18 0 

TTCTGCTACA ATTGTAAAAG TGAGATGTAC C C AATACGGT TTAAGCGGAC CGAGAATAGT 24 0 

CAATTCAAAG ACCAAATTCT G 2 61 



(2) INFORMATION FOR SEQ ID NO : 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..478 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46: 

GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTCAC CGANT CAACGAGGAC 6 0 

GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT GATTTCTTCC 12 0 

ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT CATCACTCCG 18 0 

GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG GG TAT AC AC A 24 0 

ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG GCTTAAAAAN 3 00 

CANNCCGTTC GTTNNTCATT GATTTNNAGA AATCTTCCGG GCTTATTTAA CATNTAAGAT 36 0 

GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACAAA NTTNGGGGGA 42 0 

ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC CGGCCGGT 47 8 

(2) INFORMATION FOR SEQ ID NO : 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GGCCGTTAGG ATCATCAAGA AATGGCAATA TGCTGCAGAT AAGGTTCCTG ATGATCTTTT 6 0 

CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT TCACTGGACT 12 0 

ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC CGGAACTAGG 18 0 

TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT GGTTTGCTGA 24 0 

TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAGCGTACAT CTCTATCTTT 3 00 

C AAAGG C AAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC AAGAGATATG 3 60 



GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC CATTTGGGTG 42 0 



NGGNAAAATG AGTGAAATGG CAGAGNCCGA 
ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG 
GNCNGTAAAN CNTGNAATGG GCCNAACCCG 



ACCACCAATT CCCACANNCG AGGGAGGGGA 4 80 
GNNNNNAAGN GGGNGCCAGN ACCAANCCGG 54 0 
TNCCGGATT 57 9 



(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(B) STRAIN: Nipponbare, subsp . japonica 

(D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 

(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 252 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 48: 
TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN CGACGACGGT GACGGTGTTC GTCGTCGGGA 6 0 



GGAACGTCGA CCAGGGCGCC GCNGACGTCG 

TCCCTCCCGA GCTCACCATA CGGGTGATCG 

TGTACCTCGG CTCGTGCGCC GACCTGGTGC 

GGATGACGAT TG 



TCGCCAGATG GCAAGACGTC GCGCCGAGCC 12 0 
TNCGAGGGCA GCGCGCCACG TTCCAGTCGC 18 0 
CGACGATGAG CAGCATGTTC CCGGAGCTCG 24 0 

252 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) 



HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Ambiguous 
/note= "Xaa = Cys or Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 20 . . 21 

(D) OTHER INFORMATION: /label= ambiguous 

/note= "Xaa-Xaa probably is Ser-Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 49: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Xaa Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Xaa Xaa 
20 



(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= unknown 



(ix) FEATURE : 

(A) NAME /KEY : Modi fied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Trp" 

(ix) FEATURE: 

(A) NAME/ KEY : Modified- si te 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Tyr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50: 

Xaa lie Xaa Val Xaa lie Glu Asp Glu Thr Ala Xaa Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO : 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 51: 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Tyr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTTCTACTT CTATTATTGA TAGGTTTACT CA 3 2 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4 05 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ACT TCT ACT TCT ATT ATT GAT AGG TTT ACT CAA TGT CTA AAC AAC CGA 4 8 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 



GCT GAC CCT TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC 96 
Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 



TCC TCT TTT CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC 144 
Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 



AAT GAA TCC ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC 192 
Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 



CCT TCA CAC ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG 24 0 
Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 



65 



70 



75 



80 



CTA ATG AAA ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT 2 88 
Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

GTG ACC AAT TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA 33 6 
Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

CGC TCC ATA AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCC 3 84 
Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

GGC GCC ACC CTC GGA GAA GTT 4 05 

Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Glu Ser Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His 
50 55 60 

Pro Ser His lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 

Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 



Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 



Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACGTTTATG GAGCGTAAGT TGAAC 



(2) INFORMATION FOR SEQ ID NO : 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACCCTTCAC ACATTCAAGC AGC 



(2) INFORMATION FOR SEQ ID NO : 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 



(B) STRAIN: lollo bionda 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 7.. 1626 

(ix) FEATURE : 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (372, M g") 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (379, "g") 

(ix) FEATURE: 

(A) NAME /KEY : unsure 

(B) LOCATION: replace (786, "t") 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: replace ( 1105 .. 1106 , "ga") 

(D) OTHER INFORMATION: /note= "also possible "gg" and "aa IMI 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 4 8 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe 
15 10 

CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 

TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 14 4 
Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 
35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 

CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 24 0 
Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 

ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 2 88 
Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His 
80 85 90 

ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 33 6 
lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 110 

ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT GTG ACC AAT 3 84 



Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn 
115 120 125 



TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA CGC TCC ATA 432 
Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie 
130 135 140 

AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 4 80 
Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 
145 150 155 

CTT GGT GAA GTC TAC TAC CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 52 8 
Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala 
160 165 170 

TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 576 
Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 
175 180 185 190 

GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 62 4 
Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 
195 200 205 

AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 672 
Asn lie Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn 
210 215 220 

CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC ACA GGT GGT GGT 72 0 
Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly 
225 230 235 

GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 76 8 
Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg 
240 245 250 

GTT CCT ACC ACT GTG ACC GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 
Val Pro Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin 
255 260 265 270 

AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 
Asn Leu Ser Thr lie Ala His Arg Trp lie Gin Val Ala Asp Lys Leu 
275 280 285 

GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 
Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val lie Asn Asn Thr 
290 295 300 

AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 96 0 
Asn Gly Glu Lys Thr lie Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 
305 310 315 

AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 10 0 8 
Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 
320 325 330 



GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 1056 
Gly Val Glu lie Ser Asp Cys lie Glu Met Ser Trp lie Glu Ser Val 
335 340 345 350 



CTT TTC TAC ACA AAC TTC CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 1104 
Leu Phe Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu 
355 360 365 



AGC CGT ACA CCT CAA AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 1152 
Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr 
370 375 380 



GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 12 0 0 
Val Lys Asn Thr lie Ser Lys Gin Gly Phe Glu Ser lie Phe Glu Arg 
385 390 395 



ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 124 8 
Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 
400 405 410 



AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 12 96 
Arg Met Ser Glu lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser 
415 420 425 430 



GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC 13 4 4 
Gly Asn lie Ala Lys lie Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 
435 440 445 



GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 13 92 
Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 
450 455 460 



TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 144 0 
Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
465 470 475 



TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 14 8 8 
Tyr Arg Asp Leu Asp lie Gly Val Asn Ser His Gly Lys Asn Ala Tyr 
480 485 490 



GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 15 3 6 
Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 
495 500 505 510 



AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 15 84 
Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 
515 520 525 



AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 162 6 

Arg Asn Glu Gin Ser lie Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 

TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG GTCATGGTAT 16 86 



CCAGATTCAG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA AATTATATTA 174 6 

TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT GATGTTGATA 18 06 

TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG ACTTCCCTTG 1866 

GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT CATTGGATGC 192 6 

AATTTATTAT TATAAACCTT ATTTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 1981 

(2) INFORMATION FOR SEQ ID NO : 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 amino acids, 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe Pro Leu 
15 10 15 

Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr Ser lie 
20 25 30 

lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro Ser Phe 
35 40 45 

Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe Pro Ser 
50 55 60 

Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser Thr Thr 
65 70 75 80 

Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His lie Gin 
85 90 95 

Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys Thr Arg 
100 105 110 

Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn Ser Asn 
115 120 125 

Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie Asn Val 
130 135 140 

Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly 
145 150 155 160 

Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala Phe Pro 
165 170 175 



Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly 
180 185 190 



Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp Asn lie 
195 200 205 

Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn Arg Lys 
210 215 220 

Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val 
225 230 235 240 

Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg Val Pro 
245 250 255 

Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin Asn Leu 
260 265 270 

Ser Thr lie Ala His Arg Trp lie Gin Val Ala Asp Lys Leu Asp Asn 
275 280 285 

Asp Leu Phe Leu Arg Met Thr Phe Asn Val lie Asn Asn Thr Asn Gly 
290 295 300 

Glu Lys Thr lie Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly Asn Ser 
305 310 315 320 

Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Val 
325 330 335 

Glu lie Ser Asp Cys lie Glu Met Ser Trp lie Glu Ser Val Leu Phe 
340 345 350 

Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg 
355 360 365 

Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr Val Lys 
370 375 380 

Asn Thr lie Ser Lys Gin Gly Phe Glu Ser lie Phe Glu Arg Met Lys 
385 390 395 400 

Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met 
405 410 415 

Ser Glu lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn 
420 425 430 

lie Ala Lys lie Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly Val Glu 
435 440 445 



Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp Tyr Met 
450 455 460 



Thr Pro Phe Val 
465 

Asp Leu Asp lie 



Gly Met Val Tyr 
500 

Leu Thr Met Val 
515 

Glu Gin Ser lie 
530 



Ser Lys Asn Pro 
470 

Gly Val Asn Ser 
485 

Gly His Lys Tyr 



Lys Thr Arg Val 
520 

Pro Thr Leu Ser 
535 



Arg Glu Ala Phe 
475 

His Gly Lys Asn 
490 

Phe Lys Glu Thr 
505 

Asp Pro Ser Asn 



Ser Ser Trp Lys 
540 



Leu Asn Tyr Arg 
480 

Ala Tyr Gly Glu 
495 

Asn Tyr Lys Arg 
510 

Phe Phe Arg Asn 
525 



(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 59: 
GGTAATGATC TCCTTTCTTG TTTGACC 2 7 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
AGAGCGGCCG CTATATTACA ACTTCTCCAC CATCACTCCT C 41 



(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 61 
GGTGATGTTA ATGATAATCT CCTC 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
AGAGCGGCCG CTACAATTCC TTCAACATGT AAATTTCCTC 



(2) INFORMATION FOR SEQ ID NO : 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 63 
ACTTCCCGTA GAAACTCGGA GACTTTCACA CAATGC 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TCCATCCAAG ATCAATTCAT AAACTGTGTC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGAGCGGCCG CTTTCATGAA CCTAGCTTCT AGTAGG 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGAGCGGCCG CGAAATGGCC CCCCTTTTAA AACGGGG 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGAGCGGCCG CAAATGATAT CTTCAGGTAA CTTTGTTCAC 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAGCGGCCG CATAATCAAA TAAATACACT TATGGTAACA CAG 



(2) INFORMATION FOR SEQ ID NO : 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69: 
AGAGCGGCCG CTGGTTTTGT ATTGAGGACT CAAAACAG 



(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: join (1 .. 570 , 801.. 1754) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 70: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 

15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 

20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 

35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 

50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 

Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 

65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA~ TAC 2 88 

Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 

85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 

100 105 110 

CGT TCG GTG GAT GTC GAT GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 

115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 4 32 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 

130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 4 80 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 

145 150 155 160 



CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 



528 



165 



170 



175 



ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTT AAT 57 0 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 
180 185 190 

GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG ATCGGATAGT 63 0 

GATTTTGGTC CATCAATTAA AAACTTGGTG AACATAAAAT TAACCAAGCA ATCAATTTAG 6 90 

ACAAGCAACA TAATCATATA TATTTTTCTT ACATTTGTAT GTACCTGAAT ATTTATATTT 75 0 

ATGTTTATAT GTTCTCACTA TATTTTCACT TTTGTATTTG AAAATTTTTA GGA AAA 8 06 

Gly Lys 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 854 
lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 902 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 950 
Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 998 
Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 104 6 
Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 10 94 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 1142 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 1190 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 12 3 8 
lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 



ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 



1286 



TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 
Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 



1334 



TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 13 82 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 14 3 0 

Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 14 7 8 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 

405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 15 2 6 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 15 74 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 162 2 

Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 16 7 0 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1718 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 

485 490 495 

TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 175 7 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 71: 



Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 



Ser Asp Pro Lys 
20 

Gly Ser Tyr Ser 
35 

Asn Thr Thr Ser 
50 

Glu Ser His Val 
65 

Gin Met Lys lie 



Val Thr Tyr Ser 
100 

Arg Ser Val Asp 
115 

Gly Ala lie Leu 
130 

Thr Leu Ala Tyr 
145 

His lie Ser Gly 



Thr Val Asp Asn 
180 

lie Leu Asp Arg 
195 

Gly Gly Gly Gly 
210 

Leu Val Glu Val 
225 

Leu Glu Gin Asn 



Pro Lys Leu Pro 
260 

Asn Gly Thr Val 
275 

Met Phe Leu Gly 
290 



His Pro lie Ser 



Ser Val Leu Gin 
40 

Thr Pro Lys Pro 
55 

Gin Ala Ala lie 
70 

Arg Ser Gly Gly 
85 

Gly Lys Pro Phe 



Val Asp Val Ala 
120 

Gly Glu Val Tyr 
135 

Pro Ala Gly lie 
150 

Gly Gly Tyr Gly 
165 

Thr lie Asp Ala 



Lys Leu Met Gly 
200 

Gly Ser Tyr Gly 
215 

Pro Glu Asn Val 
230 

Ala Thr Asp lie 
245 

Asp Glu Leu Phe 



Ser Ser Gin Lys 
280 

Asp Thr Thr Thr 
295 



Pro Ala lie Phe 
25 

Ala Asn lie Arg 



Phe Leu lie lie 
60 

Thr Cys Gly Lys 
75 

His Asp Tyr Asp 
90 

Phe Val Leu Asp 
105 

Ser Lys Thr Ala 



Tyr Tyr lie Trp 
140 

Cys Pro Thr Val 
155 

Asn Met Met Arg 
170 

Arg Met Val Asp 
185 

Glu Asp Leu Tyr 



Val Val Leu Ala 
220 

Thr Val Phe Arg 
235 

lie His Arg Trp 
250 

lie Arg Thr Val 
265 

Thr Val Arg Thr 



Leu Leu Ser lie 
300 



Phe Ser Gly Asn 
30 

Asn Leu Arg Phe 
45 

Ala Ala Thr His 



Arg His Asn Leu 
80 

Gly Leu Ser Tyr 
95 

Met Phe Asn Leu 
110 

Trp Val Gin Thr 
125 

Glu Lys Ser Lys 



Gly Val Gly Gly 
160 

Lys Tyr Gly Leu 
175 

Val Asn Gly Lys 
190 

Trp Ala lie Asn 
205 

Tyr Lys lie Asn 



lie Ser Arg Thr 
240 

Gin Gin Val Ala 
255 

lie Asp Val Val 
270 

Thr Phe lie Ala 
285 

Leu Asn Arg Arg 



Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 



lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 



(ix) FEATURE: 

(A) NAME/ ICE Y : CDS 

(B) LOCATION: 1..1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 

15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 

20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 

Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 

65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 2 88 

Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 

85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 33 6 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 

100 105 110 

CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 43 2 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 4 80 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 

145 150 155 160 

CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 52 8 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 

165 170 175 



ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 576 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 624 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 672 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 72 0 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 

225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 768 

Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 816 

Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 8 64 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 912 

Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 96 0 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 

305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 100 8 

lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 105 6 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

TCA GAT TAC GTA CGT GAA CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 1104 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 1152 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TCT ACG GTG ACT CCG TTC CCA 12 0 0 

Tyr Gly Gly Glu Met Gly Arg lie Ser Ser Thr Val Thr Pro Phe Pro 



385 



390 



395 



400 



TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 12 4 8 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 

405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 12 96 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 1344 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT TAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 13 92 

Asn Tyr Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 14 4 0 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 14 8 8 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 

485 490 495 

TTC TTT AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 152 7 

Phe Phe Arg Asn Glu Gin Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
1 5 10 15 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 



Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 



Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 



Met Ala Phe Asn Pro 
380 



Val Thr Pro Phe Pro 
400 

Gly Ala Asn Trp Arg 
415 

Thr Arg Lys Leu Tyr 
430 

Arg Gin Ser Phe Phe 
445 

His Asn Gly Lys lie 
460 

Lys Tyr Phe Ala Gly 
480 

Val Asp Ser Gly Asn 
495 

Pro 

500 505 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1527 



Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr 
370 375 



Tyr Gly Gly Glu Met Gly Arg lie Ser Ser Thr 
385 390 395 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr 
405 410 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu 
420 425 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro 
435 440 

Asn Tyr Arg Asp Val Asp Leu Gly lie Asn Ser 
450 455 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys 
465 470 475 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg 
485 490 

Phe Phe Arg Asn Glu Gin Ser lie Pro Val Leu 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 4 8 

Ser lie Gin Asp Gin Phe lie Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 

TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 96 
Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 



TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC CAG TTC TTG 144 
Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA CCG ATT CAC CAG 192 
Ala Lys Ser Met Pro Lys Pro Gly Phe lie Phe Arg Pro lie His Gin 
50 55 60 

TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 24 0 
Ser Gin Val Gin Ala Ser lie lie Cys Ser Lys Lys Leu Gly lie His 
65 70 75 80 

TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 288 
Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 

TCA CGG ATT GAA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 336 
Ser Arg lie Glu Lys Pro Phe lie Leu Leu Asp Leu Ser Lys Leu Lys 
100 105 110 

CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 3 84 
Gin lie Asn Val Asp lie Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

GCT, ACG CTT GGT GAG CTT TAC TAC AGA ATT GCA GAG AAG AGC AAG ATC 4 32 
Ala Thr Leu Gly Glu Leu Tyr Tyr Arg lie Ala Glu Lys Ser Lys lie 
130 135 140 

CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 4 80 
His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 
145 150 155 160 

ATG ACA GGC GGT GGA TAC GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 52 8 
Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 576 
Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 62 4 
Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala lie Arg Gly 
195 200 205 



GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT 6 72 
Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 
210 215 220 

GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 72 0 
Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 7 68 
Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 



TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 

Ser Lys lie lie Glu Glu lie His lie Arg Val Val Leu Arg Ala Ala 
260 265 270 

GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAC CTA GGT CAG TTT 8 64 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

GAA CTA GGG TTA ACT CAA AAG GAT TGT ACT GAA ATG AGC TGG ATT GAA 960 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

GCC GCC CTT TTC CAT GGT GGA TTT CCA ACA GGT TCT CCT ATT GAA ATT 10 08 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 

325 330 335 

TTG CTT CAG CTC AAG TCG CCT CTA GGA AAA GAT TAC TTC AAA GCA ACG 1056 

Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

TCG GAT TTC GTT AAA GAA CCT ATT CCT GTG ATA GGC TTC AAA GGA ATA 1104 

Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

TTC AAA AGA TTG ATT GAA GGA AAC ACA ACA TTT CTG AAC TGG ACT CCT 1152 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

TAC GGT GGT ATG ATG TCG AAA ATC CCT GAA TCT GCG ATC CCA TTT CCG 12 00 

Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

CAT AGA AAC GGA ACC CTC TTC AAG ATT CTC TAT TAC GCG AAC TGG CTA 12 4 8 

His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 

405 410 415 



GAG AAT GAC AAG ACA TCG AGT AGA AAA ATC AAC TGG ATC AAA GAG ATA 12 96 
Glu Asn Asp Lys Thr Ser Ser Arg Lys lie Asn Trp lie Lys Glu lie 
420 425 430 



TAC AAT TAC ATG GCG CCT TAT GTC TCA AGC AAT CCA AGA CAA GCA TAT 1344 
Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 



GTG AAC TAC AGA GAT CTA GAC TTC GGA CAG AAC AAG AAC AAC GCA AAG 13 92 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 

450 455 460 

GTT AAC TTC ATT GAA GCT AAA ATC TGG GGA CCT AAG TAC TTC AAA GGC 144 0 

Val Asn Phe lie Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 



AAT TTT GAC AGA TTG GTG AAG ATT AAA ACC AAG GTT GAT CCA GAG AAC 14 8 8 
Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

TTC TTC AGG CAC GAG CAG AGT ATC CCA CCT ATG CCC TAC TAG 15 3 0 
Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 
500 505 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 09 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser He Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 

Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
,35 40 45 

Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 
50 55 60 

Ser Gin Val Gin Ala Ser He He Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 



Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 



100 



105 



110 



Gin He Asn Val Asp He Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

Ala Thr Leu Gly Glu Leu Tyr Tyr Arg He Ala Glu Lys Ser Lys He 
130 135 140 

His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 
145 150 155 160 

Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 1 170 175 

Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala He Arg Gly 
195 200 205 

Gly Gly Gly Ala Ser Phe Gly He Val Leu Ala Trp Lys He Lys Leu 
210 215 220 

Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

Glu Gin Asp Ala Arg Leu Lys Thr He Ser Lys Trp Gin Gin He Ser 
245 250 255 

Ser Lys He He Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 

Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 



Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 



385 



390 



395 



400 



His Arg Asn Gly Thr Leu Phe Lys lie Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

Glu Asn Asp Lys Thr Ser Ser Arg Lys lie Asn Trp lie Lys Glu lie 
420 425 430 

Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 

Val Asn Phe lie Glu Ala Lys lie Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

Asn Phe Asp Arg Leu Val Lys lie Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

Phe Phe Arg His Glu Gin Ser lie Pro Pro Met Pro Tyr 
500 505 

(2) INFORMATION FOR SEQ ID NO : 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



Met Glu Asn Lys Thr Pro lie Phe Phe Ser Leu Ser lie Phe Leu Ser 
15 10 15 



Leu Leu Asn Cys Ala Leu Gly Gly Asn Asp Leu Leu Ser Cys Leu Thr 
20 25 30 



Phe Asn Gly Val Arg Asn His Thr Val Phe Ser Ala Asp Ser Asp Ser 
35 40 45 



Asp Phe Asn Arg Phe Leu His Leu Ser lie Gin Asn Pro Leu Phe Gin 
50 55 60 



Asn Ser Leu lie Ser Lys Pro Ser Ala lie lie Leu Pro Gly Ser Lys 
65 70 75 80 



Glu Glu Leu Ser Asn Thr lie Arg Cys lie Arg Lys Gly Ser Trp Thr 
85 90 95 



lie Arg Leu Arg Ser Gly Gly His Ser Tyr Glu Gly Leu Ser Tyr Thr 
100 105 110 



Ser Asp Thr Pro Phe lie Leu lie Asp Leu Met Asn Leu Asn Arg Val 
115 120 125 



Ser lie Asp Leu Glu Ser Glu Thr Ala Trp Val Glu Ser Gly Ser Thr 
130 135 140 



Leu Gly Glu Leu Tyr Tyr Ala He Thr Glu Ser Ser Ser Lys Leu Gly 
145 150 155 160 



Phe Thr Ala Gly Trp Cys Pro Thr Val Gly Thr Gly Gly His He Ser 
165 170 175 



Gly Gly Gly Phe Gly Met Met Ser Arg Lys Tyr Gly Leu Ala Ala Asp 

180 185 190 

Asn Val Val Asp Ala He Leu He Asp Ala Asn Gly Ala He Leu Asp 

195 200 205 



Arg Gin Ala Met Gly Glu Asp Val Phe Trp Ala He Arg Gly Gly Gly 
210 215 220 



Gly Gly Val Trp Gly Ala He Tyr Ala Trp Lys He Lys Leu Leu Pro 
225 230 235 240 



Val Pro Glu Lys Val Thr Val Phe Arg Val Thr Lys Asn Val Ala He 
245 250 255 



Asp Glu Ala Thr Ser Leu Leu His Lys Trp Gin Phe Val Ala Glu Glu 
260 265 270 



Leu Glu Glu Asp Phe Thr Leu Ser Val Leu Gly Gly Ala Asp Glu Lys 
275 280 285 



Gin Val Trp Leu Thr Met Leu Gly Phe His Phe Gly Leu Lys Thr Val 
290 295 300 



Ala Lys Ser Thr Phe Asp Leu Leu Phe Pro Glu Leu Gly Leu Val Glu 
305 310 315 320 



Glu Asp Tyr Leu Glu Met Ser Trp Gly Glu Ser Phe Ala Tyr Leu Ala 
325 330 335 



Gly Leu Glu Thr Val Ser Gin Leu Asn Asn Arg Phe Leu Lys Phe Asp 
340 345 350 



Glu Arg Ala Phe Lys Thr Lys Val Asp Leu Thr Lys Glu Pro Leu Pro 
355 360 365 



Ser Lys Ala Phe Tyr Gly Gly Leu Leu Glu Arg Leu Ser Lys Glu Pro 
370 375 380 



Asn Gly Phe lie Ala Leu Asn Gly Phe Gly Gly Gin Met Ser Lys lie 
385 390 395 400 



Ser Ser Asp Phe Thr Pro Phe Pro His Arg Ser Gly Thr Arg Leu Met 
405 410 415 



Val Glu Tyr lie Val Ala Trp Asn Gin Ser Glu Gin Lys Lys Lys Thr 
420 425 430 



Glu Phe Leu Asp Trp Leu Glu Lys Val Tyr Glu Phe Met Lys Pro Phe 
435 440 445 



Val Ser Lys Asn Pro Arg Leu Gly Tyr Val Asn His lie Asp Leu Asp 
450 455 460 



Leu Gly Gly lie Asp Trp Gly Asn Lys Thr Val Val Asn Asn Ala lie 
465 470 475 480 



Glu lie Ser Arg Ser Trp Gly Glu Ser Tyr Phe Leu Ser Asn Tyr Glu 
485 490 495 



Arg Leu lie Arg Ala Lys Thr Leu lie Asp Pro Asn Asn Val Phe Asn 
500 505 510 



His Pro Gin Ser lie Pro Pro Met Ala Asn Phe Asp Tyr Leu Glu Lys 



515 



520 



525 



Thr Leu Gly Ser Asp Gly Gly Glu Val Val lie 
530 535 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Met Met Cys Arg Ser Leu Thr Leu Arg Phe Phe Leu Phe lie Val Leu 
15 10 15 



Leu Gin Thr Cys Val Arg Gly Gly Asp Val Asn Asp Asn Leu Leu Ser 
20 25 30 



Ser Cys Leu Asn Ser His Gly Val His Asn Phe Thr Thr Leu Ser Thr 
35 40 45 



Asp Thr Asn Ser Asp Tyr Phe Lys Leu Leu His Ala Ser Met Gin Asn 
50 55 60 



Pro Leu Phe Ala Lys Pro Thr Val Ser Lys Pro Ser Phe lie Val Met 
65 70 75 80 



Pro Gly Ser Lys Glu Glu Leu Ser Ser Thr Val His Cys Cys Thr Arg 
85 90 95 



Glu Ser Trp Thr lie Arg Leu Arg Ser Gly Gly His Ser Tyr Glu Gly 
100 105 110 



Leu Ser Tyr Thr Ala Asp Thr Pro Phe Val lie Val Asp Met Met Asn 
115 120 125 



Leu Asn Arg lie Ser lie Asp Val Leu Ser Glu Thr Ala Trp Val Glu 



130 135 140 



Ser Gly Ala Thr Leu Gly Glu Leu Tyr Tyr Ala lie Ala Gin Ser Thr 
145 150 155 160 



Asp Thr Leu Gly Phe Thr Ala Gly Trp Cys Pro Thr Val Gly Ser Gly 
165 170 175 



Gly His lie Ser Gly Gly Gly Phe Gly Met Met Ser Arg Lys Tyr Gly 
180 185 190 



Leu Ala Ala Asp Asn Val Val Asp Ala lie Leu lie Asp Ser Asn Gly 
195 200 205 



Ala lie Leu Asp Arg Glu Lys Met Gly Asp Asp Val Phe Trp Ala lie 
210 215 220 



Arg Gly Gly Gly Gly Gly Val Trp Gly Ala lie Tyr Ala Trp Lys lie 
225 230 235 240 



Lys Leu Leu Pro Val Pro Glu Lys Leu Thr Val Phe Arg Val Thr Lys 
245 250 255 



Asn Val Gly lie Glu Asp Ala Ser Ser Leu Leu His Lys Trp Gin Tyr 
260 265 270 



Val Ala Asp Glu Leu Asp Glu Asp Phe Thr Val Ser Val Leu Gly Gly 
275 280 285 



Val Asn Gly Asn Asp Ala Trp Leu Met Phe Leu Gly Leu His Leu Gly 
290 295 300 



Arg Lys Asp Ala Ala Lys Thr lie lie Asp Glu Lys Phe Pro Glu Leu 
305 310 315 320 



Gly Leu Val Asp Lys Glu Phe Gin Glu Met Ser Trp Gly Glu Ser Met 
325 330 335 



Ala Phe Leu Ser Gly Leu Asp Thr lie Ser Glu Leu Asn Asn Arg Phe 
340 345 350 



Leu Lys Phe Asp Glu Arg Ala Phe Lys Thr Lys Val Asp Phe Thr Lys 
355 360 365 



Val Ser Val Pro 
370 



Ser Glu Gin Pro 
385 



Met Ser Glu lie 



Thr Lys Leu Met 
420 



Ser Lys lie Gly 
435 



Leu Glu Pro Phe 
450 



lie Asp Leu Asp 
465 

Thr Asn Ala Val 



Ser Asn Tyr Glu 
500 



Asn Val Phe Asn 
515 



Leu Asn Val Phe 
375 



Gly Gly Phe lie 
390 



Ser Thr Asp Phe 
405 



Phe Glu Tyr lie 



Glu Phe Ser Glu 
440 



Val Ser Lys Glu 
455 



lie Gly Gly lie 
470 

Glu lie Ala Arg 
485 



Arg Leu Val Lys 



His Pro Gin Ser 
520 



Arg His His Ala 
380 



Ala Leu Asn Gly 
395 



Thr Pro Phe Pro 
410 



lie Ala Trp Asn 
425 



Trp Leu Ala Lys 



Pro Arg Val Gly 
460 



Asp Trp Arg Asn 
475 

Asn Trp Gly Glu 
490 



Ala Lys Thr Leu 
505 



lie Pro Pro Met 



Leu Glu Met Leu 



Phe Gly Gly Lys 
400 



His Arg Lys Gly 
415 



Gin Asp Glu Glu 
430 



Phe Tyr Asp Tyr 
445 



Tyr Val Asn His 



Lys Ser Ser Thr 
480 

Arg Tyr Phe Ser 
495 



lie Asp Pro Asn 
510 



Met Lys Phe Glu 
525 



Glu lie Tyr Met Leu Lys Glu Leu 
530 535 



REMARKS 



The Official Action of July 15, 2003 has been carefully considered and 
reconsideration of the application in view of the present submission is respectfully 
requested. 

The indication that claims 51-53, 55, 59 and 60 are allowed has been noted 
with appreciation. To remove the basis for the objection to the specification, 
Applicants have amended the specification to include in the Sequence Listing SEQ ID 
NOS: 76 and 77, which correspond to the respective sequences of the berberine bridge 
enzymes EcBBE and PsBBE shown in Fig. 10 of the drawings. Applicants 
respectfully note that the present application claims the benefit of a parent PCT 
application which was filed prior to July 1, 1998 and that the revised sequence may 
thus comply with the sequence rules then in effect (see MPEP Section 2420 and 63 
FR 29634 (June 1, 1998)). 

Applicants submit herewith a computer readable form copy and a paper copy 
of the Sequence Listing along with the requisite statements that the computer readable 
form copy and paper copy are the same and that they contain no new matter. 
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In view of the above, it is respectfully submitted that the sole remaining 
objection to the application has been overcome and that the application is now in 
allowable form. An early notice of allowance is earnestly solicited and is believed to 



be fully warranted. 




\ 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: STUIVER, Maarten Hendrik 

CUSTERS, Jerome Humbertina Henricus Victor 
SELA-BURLAGE , Marianne Beatrix 
MELCHERS , Leo Sjoerd 

VAN DE VENTER - TROOST , Johanna Pieternella 
LAGEWEG, Wessel 
PONSTEIN, Anne Silene 
LAGEWEG, Wessel 
PONSTEIN, Anne Silene 

(ii) TITLE OF INVENTION: ANTIFUNGAL PROTEINS, DNA CODING THEREFORE, 

AND HOSTS INCORPORATING SAME. 

(iii) NUMBER OF SEQUENCES: 77 

a. 

(iv) CORRESPONDENCE ADDRESS: (A) 

(B) 
(C) 
(D) 
(E) 
(F) 

(v) COMPUTER READABLE FORM: (A) 

(B) 
(O 
(D) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/258,031 

(B) FILING DATE: 25-FEB-1999 

(C) CLASSIFICATION: 43 5 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP97/ 04 92 3 

(B) FILING DATE: 04-SEP-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP97200831.2 

(B) FILING DATE: 19-MAR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: EP96202466.7 

(B) FILING DATE: 04-SEP-1996 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



ADDRESSEE: LADAS & PARRY 

STREET 2 6 WEST 61 STREET 

CITY NEW YORK 

NY 

USA 

10023 - 7604 

MEDIUM TYPE: 3.25" Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: WINDOWS 95 
SOFTWARE: WORDPERFECT 8 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv. zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Ser lie Asn Val Asp lie Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Tyr Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO : 2 : . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr Pro Gly Xaa Ser 
15 10 15 

Ser Phe Pro Thr Val Leu Gin Asn Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO : 3: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE : 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
AACTTCTCCN AGNGTNGCNC CNGCTTGNAC CCA 3 3 



(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/ KEY : mi s c_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GATCCNTCTT TCCCNATTAC TGGNGAGGTT TA 3 2 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: cv . zebulon 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 354 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 



GAT CCG TCT TTC CCG ATT ACT GGG GAG GTT TAC ACT CCC GGA AAC TCA 4 8 

Asp Pro Ser Phe Pro He Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser 
1 5 10 15 

TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT 96 
Ser Phe Pro Thr Val Leu Gin Asn Tyr He Arg Asn Leu Arg Phe Asn 
20 25 30 

GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT 144 
Glu Thr Thr Thr Pro Lys Pro Phe Leu He He Thr Ala Glu His Val 
35 40 45 

TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA 192 
Ser His He Gin Ala Ala Val Val Cys Gly Lys Gin Asn Arg Leu Leu 
50 55 60 

CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT 24 0 
Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu 
65 70 75 80 

ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG 288 
Thr Asn Thr Asn Gin Pro Phe Phe He Val Asp Met Phe Asn Leu Arg 
85 90 95 

TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG GTC CAA GCC GGC 
336 

Ser lie Asn Val Asp He Glu Gin Glu Thr Ala Trp Val Gin Ala Gly 

100 105 110 

GCC ACC CTC GGA GAA GTT 354 
Ala Thr Leu Gly Glu Val 
115 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 



Asp Pro Ser Phe 
1 

Ser Phe Pro Thr 
20 

Glu Thr Thr Thr 
35 

Ser His lie Gin 
50 

Leu Lys Thr Arg 
65 

Thr Asn Thr Asn 



Ser lie Asn Val 
100 

Ala Thr Leu Gly 
115 



Pro lie Thr Gly 

5 

Val Leu Gin Asn 



Pro Lys Pro Phe 
40 

Ala Ala Val Val 
55 

Ser Gly Gly His 
70 

Gin Pro Phe Phe 
85 

Asp lie Glu Gin 



Glu Val 



Glu Val Tyr Thr 
10 

Tyr lie Arg Asn 
25 

Leu lie lie Thr 



Cys Gly Lys Gin 
60 

Asp Tyr Glu Gly 
75 

lie Val Asp Met 
90 

Glu Thr Ala Trp 
105 



Pro Gly Asn Ser 
15 

Leu Arg Phe Asn 
30 

Ala Glu His Val 
45 

Asn Arg Leu Leu 



Leu Ser Tyr Leu 
80 

Phe Asn Leu Arg 
95 

Val Gin Ala Gly 
110 



(2) INFORMATION FOR SEQ ID NO : 7: 

(i) ^ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function= "primer" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CAGGCAGCTG TGGTTTGTGG C 



(2) INFORMATION FOR SEQ ID NO : 8: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDMA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GTCCACAATG AAGAAGGGTT G 



(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /function; 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : : 
ACGTAGATAT CGAACAAGAA ACCGC 



(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GCTTTACTAC ACGGGCTTCC CCAG 



(2) INFORMATION FOR SEQ ID NO : 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11 
CTGGGGAAGC CCGTGTAGTA AAGC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTACTCCAA CCACGGCGCT C 



(2) INFORMATION FOR SEQ ID NO : 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGGAAGTTG CAGAAGATTG GGTTG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAGCAAGAGA AGAAGGAGAC 2 0 

(2) INFORMATION FOR SEQ ID NO : 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 21.. 1608 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



ATATCACATC TTCTTTCAAC ATG CAA ACT TCC ATT CTT ACT CTC CTT CTT 



50 



Met Gin Thr Ser lie Leu Thr Leu Leu Leu 
15 10 



CTC TTG CTC TCA ACC CAA TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT 98 
Leu Leu Leu Ser Thr Gin Ser Ser Ala Thr Ser Arg Ser lie Thr Asp 
15 20 25 

CGC TTC ATT CAA TGT TTA CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA 14 6 
Arg Phe lie Gin Cys Leu His Asp Arg Ala Asp Pro Ser Phe Pro lie 
30 35 40 

ACC GGA GAG GTT TAC ACT CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG 194 
Thr Gly Glu Val Tyr Thr Pro Gly Asn Ser Ser Phe Pro Thr Val Leu 
45 50 55 

CAA AAC TAC ATC CGA AAC CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA 24 2 
Gin Asn Tyr lie Arg Asn Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys 
60 65 70 

CCC TTT TTA ATC ATC ACA GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT 2 90 
Pro Phe Leu lie lie Thr Ala Glu His Val Ser His lie Gin Ala Ala 
75 80 85 90 

GTG GTT TGT GGC AAA CAA AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT 3 38 
Val Val Cys Gly Lys Gin Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly 

95 100 105 

GGT CAT GAT TAT GAA GGT CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC 3 86 
Gly His Asp Tyr Glu Gly Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro 
110 115 120 

TTC TTC ATT GTG GAC ATG TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC 4 34 
Phe Phe lie Val Asp Met Phe Asn Leu Arg Ser lie Asn Val Asp lie 
125 130 135 

GAA CAA GAA ACC GCA TGG GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG 4 82 

Glu Gin Glu Thr Ala Trp Val Gin Ala Gly Ala Thr Leu Gly Glu Val 
140 145 150 

TAC TAT CGA ATA GCG GAG AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG 53 0 

Tyr Tyr Arg lie Ala Glu Lys Ser Asn Lys His Gly Phe Pro Ala Gly 
155 160 165 170 

GTT TGT CCA ACG GTT GGC GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT 57 8 

Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly Gly Tyr 
175 180 185 

GGT AAT TTG ATG AGA AAA TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT 62 6 

Gly Asn Leu Met Arg Lys Tyr Gly Leu Ser Val Asp Asn lie Val Asp 
190 195 200 

GCT CAA ATA ATA GAT GTG AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG 674 
Ala Gin He He Asp Val Asn Gly Lys Leu Leu Asp Arg Lys Ser Met 
205 210 215 



GGT GAG GAT TTG TTT TGG GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT 72 2 

Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly Gly Val Ser Phe 
220 225 230 

GGT GTG GTT CTA GCC TAC AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT 77 0 

Gly Val Val Leu Ala Tyr Lys lie Lys Leu Val Arg Val Pro Glu Val 
235 240 245 250 

GTG ACC GTG TTT ACC ATT GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC 818 
Val Thr Val Phe Thr lie Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr 
255 260 265 

ATC GCG GAA CGA TGG GTA CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT 8 66 

lie Ala Glu Arg Trp Val Gin Val Ala Asp Lys Leu Asp Arg Asp Leu 
270 275 280 

TTC CTT CGA ATG ACC TTT AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG 914 
Phe Leu Arg Met Thr Phe Ser Val lie Asn Asp Thr Asn Gly Gly Lys 
285 290 295 

ACA GTC CGT GCT ATC TTT CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT 962 
Thr Val Arg Ala lie Phe Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn 
300 305 310 

CTT GTT ACA CTT TTG AAT AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA 1010 
Leu Val Thr Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu 
315 320 325 330 

TCG GAT TGT ACT GAA ATG AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG 10 5 8 
Ser Asp Cys Thr Glu Met Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr 
335 340 345 

GGC TTC CCC AGT GGT ACT CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT 1106 
Gly Phe Pro Ser Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro 
350 355 360 

CAA AGA CTC AAC CCA TTC AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT 1154 
Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr Val Gin Asn Pro 
365 370 375 

ATT TCT AAA CGA CAG TTC GAG TTC ATC TTC GAA AGG CTG AAA GAA CTT 12 02 
lie Ser Lys Arg Gin Phe Glu Phe lie Phe Glu Arg Leu Lys Glu Leu 
380 385 390 

GAA AAC CAA ATG TTG GCT TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA 1250 
Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu 
395 400 405 410 

ATA TCC GAA TTC GCA AAG CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG 12 98 
lie Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn lie Ala 
415 420 425 

AAA ATT CAA TAC GAA GTA AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA 13 4 6 
Lys lie Gin Tyr Glu Val Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu 



430 



435 



440 



AAT CGT TAC TTG AAT TTC ACA AGG CTG ATG TAT GAT TAC ATG ACC CCA 13 94 
Asn Arg Tyr Leu Asn Phe Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro 
445 450 455 

TTT GTG TCG AAA AAC CCT AGA AAA GCA TTT TTG AAC TAT AGG GAT TTG 144 2 
Phe Val Ser Lys Asn Pro Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu 
460 465 470 

GAT ATT GGT ATC AAC AGC CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG 14 9 0 
Asp lie Gly lie Asn Ser His Gly Arg Asn Ala Tyr Thr Glu Gly Met 
475 480 485 490 

GTT TAT GGG CAC AAG TAT TTC AAA GAG ACA AAT TAC AAG AGG CTA GTA 15 3 8 
Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val 
495 500 505 

AGT GTG AAG ACT AAA GTT GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA 15 8 6 
Ser Val Lys Thr Lys Val Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin 
510 515 520 

AGC ATC CCA ACT TTG TCA TCT T GAAGAACGTA CATATATAAA TAAATACCTT 163 8 

Ser lie Pro Thr Leu Ser Ser 
525 

TGTGCATGGT ATTTTCAGGG TGTTAAAGTG ATATTCAGAT ATTTATGATA GAATTTTGAC 16 98 
TTGTATTTTA TACAATCAAA ATTGTATGGT TCTCCGAATT TCTCTTTTTA ATTCTGAAAA 17 58 
ATACATATTA GTATTGTCAA AAAAAA 17 84 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Gin Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 



Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 



50 



55 



60 



Leu Arg Phe Asn 
65 

Ala Glu His Val 



Asn Arg Leu Leu 
100 

Leu Ser Tyr Leu 
115 

Phe Asn Leu Arg 
130 

Val Gin Ala Gly 
145 

Lys Ser Asn Lys 



Val Gly Gly His 
180 

Tyr Gly Leu Ser 
195 

Asn Gly Lys Leu 
210 

Ala lie Thr Gly 
225 

Lys lie Lys Leu 



Glu Arg Arg Glu 
260 

Gin Val Ala Asp 
275 

Ser Val lie Asn 
290 

Pro Thr Leu Tyr 
305 

Lys Asp Phe Pro 



Ser Trp Val Glu 



Glu Thr Thr Thr 
70 

Ser His He Gin 
85 

Leu Lys Thr Arg 



Thr Asn Thr Asn 
120 

Ser He Asn Val 
135 

Ala Thr Leu Gly 
150 

His Gly Phe Pro 
165 

Phe Ser Gly Gly 



Val Asp Asn He 
200 

Leu Asp Arg Lys 
215 

Gly Gly Gly Val 
230 

Val Arg Val Pro 
245 

Glu Gin Asn Leu 



Lys Leu Asp Arg 
280 

Asp Thr Asn Gly 
295 

Leu Gly Asn Ser 
310 

Glu Leu Gly Leu 
325 

Ser Val Leu Tyr 



Pro Lys Pro Phe 
75 

Ala Ala Val Val 
90 

Ser Gly Gly His 
105 

Gin Pro Phe Phe 



Asp He Glu Gin 
140 

Glu Val Tyr Tyr 
155 

Ala Gly Val Cys 
170 

Gly Tyr Gly Asn 
185 

Val Asp Ala Gin 



Ser Met Gly Glu 
220 

Ser Phe Gly Val 
235 

Glu Val Val Thr 
250 

Ser Thr He Ala 
265 

Asp Leu Phe Leu 



Gly Lys Thr Val 
300 

Arg Asn Leu Val 
315 

Gin Glu Ser Asp 
330 

Tyr Thr Gly Phe 



Leu He He Thr 
80 

Cys Gly Lys Gin 
95 

Asp Tyr Glu Gly 
110 

He Val Asp Met 
125 

Glu Thr Ala Trp 



Arg He Ala Glu 
160 

Pro Thr Val Gly 
175 

Leu Met Arg Lys 
190 

He He Asp Val 
205 

Asp Leu Phe Trp 



Val Leu Ala Tyr 
240 

Val Phe Thr He 
255 

Glu Arg Trp Val 
270 

Arg Met Thr Phe 
285 

Arg Ala He Phe 



Thr Leu Leu Asn 
320 

Cys Thr Glu Met 
335 

Pro Ser Gly Thr 



340 



345 



350 



Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

Glu Phe lie Phe Glu Arg Leu Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 

Arg Lys Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 

Ser 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CCGCCATGGA GACTTCCATT CTTACTC 



(2) INFORMATION FOR SEQ ID NO : 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCGGATCCT CAAGATGACA AAGTTGGGAT GCT 



(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helianthus annuus 

(B) STRAIN: Zebulon 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1590 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAG ACT TCC ATT CTT ACT CTC CTT CTT CTC TTG CTC TCA ACC CAA 
Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 



TCT TCT GCA ACT TCC CGT TCC ATT ACA GAT CGC TTC ATT CAA TGT TTA 
Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 



20 



25 



30 



CAC GAC CGG GCC GAC CCT TCA TTT CCG ATA ACC GGA GAG GTT TAC ACT 14 4 
His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

CCC GGA AAC TCA TCT TTT CCT ACC GTC TTG CAA AAC TAC ATC CGA AAC 192 
Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

CTT CGG TTC AAT GAA ACT ACC ACA CCA AAA CCC TTT TTA ATC ATC ACA 24 0 
Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 

GCC GAA CAT GTT TCC CAC ATT CAG GCA GCT GTG GTT TGT GGC AAA CAA 28 8 
Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 

AAC CGG TTG CTA CTG AAA ACC AGA AGC GGT GGT CAT GAT TAT GAA GGT 33 6 
Asn Arg Leu Leu Leu Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly 
100 105 110 

CTT TCC TAC CTT ACA AAC ACA AAC CAA CCC TTC TTC ATT GTG GAC ATG 3 84 
Leu Ser Tyr Leu Thr Asn Thr Asn Gin Pro Phe Phe lie Val Asp Met 
115 120 125 

TTC AAT TTA AGG TCC ATA AAC GTA GAT ATC GAA CAA GAA ACC GCA TGG 4 32 
Phe Asn Leu Arg Ser He Asn Val Asp He Glu Gin Glu Thr Ala Trp 
130 135 140 

GTC CAA GCC GGT GCG ACT CTT GGT GAA GTG TAC TAT CGA ATA GCG GAG 4 80 
Val Gin Ala Gly Ala Thr Leu Gly Glu Val Tyr Tyr Arg He Ala Glu 
145 150 155 160 

AAA AGT AAC AAG CAT GGT TTT CCG GCA GGG GTT TGT CCA ACG GTT GGC 52 8 
Lys Ser Asn Lys His Gly Phe Pro Ala Gly Val Cys Pro Thr Val Gly 
165 170 175 

GTT GGT GGG CAT TTT AGT GGT GGT GGG TAT GGT AAT TTG ATG AGA AAA 5 76 
Val Gly Gly His Phe Ser Gly Gly Gly Tyr Gly Asn Leu Met Arg Lys 
180 185 190 

TAT GGT TTG TCG GTT GAT AAT ATT GTT GAT GCT CAA ATA ATA GAT GTG 624 
Tyr Gly Leu Ser Val Asp Asn He Val Asp Ala Gin He He Asp Val 
195 200 205 

AAT GGC AAG CTT TTG GAT CGA AAG AGT ATG GGT GAG GAT TTG TTT TGG 6 72 
Asn Gly Lys Leu Leu Asp Arg Lys Ser Met Gly Glu Asp Leu Phe Trp 
210 215 220 

GCG ATC ACC GGC GGT GGT GGT GTT AGT TTT GGT GTG GTT CTA GCC TAC 72 0 
Ala He Thr Gly Gly Gly Gly Val Ser Phe Gly Val Val Leu Ala Tyr 
225 230 235 240 



AAA ATC AAA CTA GTT CGT GTT CCG GAG GTT GTG ACC GTG TTT ACC ATT 7 68 
Lys lie Lys Leu Val Arg Val Pro Glu Val Val Thr Val Phe Thr lie 
245 250 255 

GAA AGA AGA GAG GAA CAA AAC CTC AGC ACC ATC GCG GAA CGA TGG GTA 816 
Glu Arg Arg Glu Glu Gin Asn Leu Ser Thr lie Ala Glu Arg Trp Val 
260 265 270 

CAA GTT GCT GAT AAG CTA GAT AGA GAT CTT TTC CTT CGA ATG ACC TTT 864 
Gin Val Ala Asp Lys Leu Asp Arg Asp Leu Phe Leu Arg Met Thr Phe 
275 280 285 

AGT GTC ATA AAC GAT ACC AAC GGT GGA AAG ACA GTC CGT GCT ATC TTT 912 
Ser Val lie Asn Asp Thr Asn Gly Gly Lys Thr Val Arg Ala lie Phe 
290 295 300 

CCA ACG TTG TAC CTT GGA AAC TCG AGG AAT CTT GTT ACA CTT TTG AAT 96 0 
Pro Thr Leu Tyr Leu Gly Asn Ser Arg Asn Leu Val Thr Leu Leu Asn 
305 310 315 320 

AAA GAT TTC CCC GAG TTA GGG TTG CAA GAA TCG GAT TGT ACT GAA ATG 10 0 8 
Lys Asp Phe Pro Glu Leu Gly Leu Gin Glu Ser Asp Cys Thr Glu Met 
325 330 335 

AGT TGG GTT GAG TCT GTG CTT TAC TAC ACG GGC TTC CCC AGT GGT ACT 1056 
Ser Trp Val Glu Ser Val Leu Tyr Tyr Thr Gly Phe Pro Ser Gly Thr 
340 345 350 

CCA ACC ACG GCG CTC TTA AGC CGT ACT CCT CAA AGA CTC AAC CCA TTC 1104 
Pro Thr Thr Ala Leu Leu Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe 
355 360 365 

AAG ATC AAA TCC GAT TAT GTG CAA AAT CCT ATT TCT AAA CGA CAG TTC 1152 
Lys lie Lys Ser Asp Tyr Val Gin Asn Pro lie Ser Lys Arg Gin Phe 
370 375 380 

GAG TTC ATC TTC GAA AGG ATG AAA GAA CTT GAA AAC CAA ATG TTG GCG 12 0 0 
Glu Phe lie Phe Glu Arg Met Lys Glu Leu Glu Asn Gin Met Leu Ala 
385 390 395 400 

TTC AAC CCA TAT GGT GGT AGA ATG AGT GAA ATA TCC GAA TTC GCA AAG 12 4 8 
Phe Asn Pro Tyr Gly Gly Arg Met Ser Glu lie Ser Glu Phe Ala Lys 
405 410 415 

CCT TTC CCA CAT AGA TCG GGT AAC ATA GCG AAG ATT CAA TAC GAA GTA 12 96 
Pro Phe Pro His Arg Ser Gly Asn lie Ala Lys lie Gin Tyr Glu Val 
420 425 430 

AAC TGG GAG GAT CTT AGC GAT GAA GCC GAA AAT CGT TAC TTG AAT TTC 13 44 
Asn Trp Glu Asp Leu Ser Asp Glu Ala Glu Asn Arg Tyr Leu Asn Phe 
435 440 445 

ACA AGG CTG ATG TAT GAT TAC ATG ACT CCA TTT GTG TCG AAA AAC CCT 13 92 



Thr Arg Leu Met Tyr Asp Tyr Met Thr Pro Phe Val Ser Lys Asn Pro 
450 455 460 



AGA GAA GCA TTT TTG AAC TAT AGG GAT TTG GAT ATT GGT ATC AAC AGC 144 0 
Arg Glu Ala Phe Leu Asn Tyr Arg Asp Leu Asp lie Gly lie Asn Ser 
465 470 475 480 

CAT GGC AGG AAT GCT TAT ACT GAA GGA ATG GTT TAT GGG CAC AAA TAT 14 8 8 
His Gly Arg Asn Ala Tyr Thr Glu Gly Met Val Tyr Gly His Lys Tyr 
485 490 495 

TTC AAA GAG ACA AAT TAG AAG AGG CTA GTA AGT GTG AAG ACT AAA GTT 153 6 
Phe Lys Glu Thr Asn Tyr Lys Arg Leu Val Ser Val Lys Thr Lys Val 
500 505 510 

GAT CCT GAC AAC TTC TTT AGG AAT GAG CAA AGC ATC CCA ACT TTG TCA 15 84 
Asp Pro Asp Asn Phe Phe Arg Asn Glu Gin Ser lie Pro Thr Leu Ser 
515 520 525 

TCT TG 15 8 9 

Ser 

530 



(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 20: 

Met Glu Thr Ser lie Leu Thr Leu Leu Leu Leu Leu Leu Ser Thr Gin 
15 10 15 

Ser Ser Ala Thr Ser Arg Ser lie Thr Asp Arg Phe lie Gin Cys Leu 
20 25 30 

His Asp Arg Ala Asp Pro Ser Phe Pro lie Thr Gly Glu Val Tyr Thr 
35 40 45 

Pro Gly Asn Ser Ser Phe Pro Thr Val Leu Gin Asn Tyr lie Arg Asn 
50 55 60 

Leu Arg Phe Asn Glu Thr Thr Thr Pro Lys Pro Phe Leu lie lie Thr 
65 70 75 80 



Ala Glu His Val Ser His lie Gin Ala Ala Val Val Cys Gly Lys Gin 
85 90 95 



Asn Arg Leu Leu Leu 
100 

Leu Ser Tyr Leu Thr 
115 

Phe Asn Leu Arg Ser 
130 

Val Gin Ala Gly Ala 
145 

Lys Ser Asn Lys His 
165 

Val Gly Gly His Phe 
180 

Tyr Gly Leu Ser Val 
195 

Asn Gly Lys Leu Leu 
210 

Ala lie Thr Gly Gly 
225 

Lys lie Lys Leu Val 
245 

Glu Arg Arg Glu Glu 
260 

Gin Val Ala Asp Lys 
275 

Ser Val lie Asn Asp 
290 

Pro Thr Leu Tyr Leu 
305 

Lys Asp Phe Pro Glu 
325 

Ser Trp Val Glu Ser 
340 

Pro Thr Thr Ala Leu 
355 

Lys lie Lys Ser Asp 
370 



Lys Thr Arg Ser Gly Gly 
105 

Asn Thr Asn Gin Pro Phe 
120 

lie Asn Val Asp lie Glu 
135 

Thr Leu Gly Glu Val Tyr 
150 155 

Gly Phe Pro Ala Gly Val 
170 

Ser Gly Gly Gly Tyr Gly 
185 

Asp Asn lie Val Asp Ala 
200 

Asp Arg Lys Ser Met Gly 
215 

Gly Gly Val Ser Phe Gly 
230 235 

Arg Val Pro Glu Val Val 
250 

Gin Asn Leu Ser Thr lie 
265 

Leu Asp Arg Asp Leu Phe 
280 

Thr Asn Gly Gly Lys Thr 
295 

Gly Asn Ser Arg Asn Leu 
310 315 

Leu Gly Leu Gin Glu Ser 
330 

Val Leu Tyr Tyr Thr Gly 
345 

Leu Ser Arg Thr Pro Gin 
360 

Tyr Val Gin Asn Pro lie 
375 



His Asp Tyr Glu Gly 
110 

Phe lie Val Asp Met 
125 

Gin Glu Thr Ala Trp 
140 

Tyr Arg lie Ala Glu 
160 

Cys Pro Thr Val Gly 
175 

Asn Leu Met Arg Lys 
190 

Gin lie lie Asp Val 
205 

Glu Asp Leu Phe Trp 
220 

Val Val Leu Ala Tyr 
240 

Thr Val Phe Thr lie 
255 

Ala Glu Arg Trp Val 
270 

Leu Arg Met Thr Phe 
285 

Val Arg Ala lie Phe 
300 

Val Thr Leu Leu Asn 
320 

Asp Cys Thr Glu Met 
335 

Phe Pro Ser Gly Thr 
350 

Arg Leu Asn Pro Phe 
365 

Ser Lys Arg Gin Phe 
380 



Glu Phe lie Phe 
385 

Phe Asn Pro Tyr 



Pro Phe Pro His 
420 

Asn Trp Glu Asp 
435 



Thr Arg Leu Met 
450 

Arg Glu Ala Phe 
465 

His Gly Arg Asn 



Phe Lys Glu Thr 
500 

Asp Pro Asp Asn 
515 

Ser 



Glu Arg Met Lys 
390 

Gly Gly Arg Met 
405 

Arg Ser Gly Asn 



Leu Ser Asp Glu 
440 



Tyr Asp Tyr Met 
455 

Leu Asn Tyr Arg 
470 

Ala Tyr Thr Glu 
485 

Asn Tyr Lys Arg 



Phe Phe Arg Asn 
520 



Glu Leu Glu Asn 
395 

Ser Glu lie Ser 
410 

lie Ala Lys lie 
425 

Ala Glu Asn Arg 



Thr Pro Phe Val 
460 

Asp Leu Asp lie 
475 

Gly Met Val Tyr 
490 

Leu Val Ser Val 
505 

Glu Gin Ser lie 



Gin Met Leu Ala 
400 

Glu Phe Ala Lys 
415 

Gin Tyr Glu Val 
430 

Tyr Leu Asn Phe 
445 



Ser Lys Asn Pro 



Gly He Asn Ser 
480 

Gly His Lys Tyr 
495 

Lys Thr Lys Val 
510 

Pro Thr Leu Ser 
525 



(2) INFORMATION FOR SEQ ID NO : 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 350 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 21: 



GAGAAACTCG GAGACTTTCA CACAATGCCT AACCTCAAAC TCCGACCCCA AACATCCCAT 60 

CTCCCCCGCT ATCTTCTTCT CCGGAAATGG CTCCTACTCC TCCGTATTAC AAGCCAACAT 12 0 

CCGTAACCTC CGCTTCAACA CCACCTCAAC TCCGAAACCC TTCCTCATAA TCGCCGCAAC 18 0 

ACATGAATCC CATGTGCAAG CCGCGATTAC TTGCGGGAAA CGCCACAACC TTCAGATGAA 24 0 

AATCAGAAGT GGAGGCCACG AC TACG ATGG CTTGTCATAC GTTACATACT CTGGCAAACC 3 00 

GTTCTTCGTC CTCGACATGT TTAACCTCCG TTCGGTGGAT GTCGACGTGG 3 50 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 278 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGCATGGATC TCCGCCGGAG CGACTCTCGG AGAGGTTTAT TATCGGATTT GGGAGAAAAG 6 0 

CAGAGTCCAT GGATTCCCCG CCGGAGTTTG ACCGACGGTT GGTGTTGGTG GGCATTTAAG 12 0 

CGGCGGTGGT TACGGTAACA TGGTGAGGAA GTTTGGATTA TCTGTGGATT ACGTTGAGGA 180 

TGCCAAGATC GTCGATGTAA ACNGTCGGGT TTTAGATCGG . AAAGCAATGG GTGAGGATCT 24 0 

GTTCTGGGCG ATTACCGGTG GAGGAGGAGG TAGCGTAC 278 



(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TGGACATATT AGCGGAGGAG GATTCGGTAC AATAATGAGG AAATACGGTT TAGCGTCTGA 60 

TAACGTTGTG GACGCACGTT TGATGGATGT AAATGGGAAA ACTCTTGACC GGAAAACGAT 12 0 

GGGAGAGGAT TTGTTTTGGG CGCTTAGAGG CGGTGGAGCT GCGAGTTTTG GCGTTGTCTT 18 0 

GTCGTGGAAG GTTAAGCTTG CTAGGGTTCC TGAAAAGGTA ACTTGTTTCA TAAGTCAACA 24 0 

TCCGATGGGA CCTAGCATGA ACAAGCTTGT TCATAGATGG CAATCCATAG GATCAAGANN 3 00 

GCTAGACGAA GATTTATTCA TCAGAGTCAA TATTGACAAC AGTCT 34 5 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



GTTCGTTAAA ACCTATCCTN NANGGGCNAA AGNATATCAA AGNTTGNTTA NGNAACC CAA 6 0 
NATTTCTGAA CTGGCCNCCT TCGGTGGTAT ATGNCNAAAN CCCTTGAATC TGCGNANCCN 12 0 
ATTCCGCATA GAAACGGAAC CCTCTTCAAG ATTCTCTATT TACNCGAACT GNCTAGANNG 18 0 
AATGACAAGA CATCGAGTAG NAAAATCAAC TGGATCAAAG AGATATACAA TTACATGGCG 24 0 
CCTTATGTCT CAAGCAATCC AAGACAAGCA TATGTGAACT ACAGAGATCT AGACTTCGGA 3 00 
CAGAACAAGA ACAACGCAAA GGTTAACTTC ATTGAAGCTA AAATCTGGGG ACCTAAGTAC 3 60 
TTCAAAGGCA ATTTTGACAG ATTGGTGAAG ATTAAAACCA AGGTTGATCC AGAGAACTTC 42 0 
TTCAGGCACG AGCAGAGTAT CCCACCTATG CCCTACTAGA AGCTAGGTTC ATGAAACCAA 4 80 
TAACATTATC AAAAATAAGR ATAAATGRTA ATTGTATACA ACATGATTCG KCTTTCTTTA 5 40 
TTTCAGACAA TGTGGACACT ACTCTAAANT AAAAWGTCNA TTTACCTTAA AAAAAAAATA 6 00 
ATCCCCNNTA ANANAAAANT GGGGGGGCCN TTTTTGGGGN TCCCGGTTTT NGGACGGGGN 6 60 
GCTTTNGGGG GGCTTGGNNT TTTTTTNGGN GCCCC 6 95 

(2) INFORMATION FOR SEQ ID NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
4 (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . .4 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 25: 

TCTGTTTTNA GGCAGAGCAG AGGAAGTTGT TGCTTTGCTT GGTAAGGAGT TTCCTGAATT 60 

NAGTTTAAAG AAGGAGAACT GTTCGGAGAT G AC TTGGTTT CAGTCAGCTT TATGGTGGGA 120 

TAATCGTGTT AACCCTACTC ANATTGATCC WAAAGTGTTT CTCGATCGGA ATCTTGATAG 18 0 



AGCGAATTTC GGAAAGAGGA AATCGGATTA 
TAAGYCTTTT TCCAAGARGA TGMCTGACCT 
TGGTGGGAAA ATGGCGGAGG TTACGGTTAA 
TTTTAAGATT CAGTACTCGG TGACTTNGCA 
TCTTGAATCA GGCTAACGTC CTTATAGGTT 
AATNCTTACT TMAAT 



CGTTGCGAGT AAGATTCCTA GAGATGGGAT 24 0 
GGGGAAAAYC GGGCTTGTTT TTAAWCCGTA 3 00 
CGCGACGCCG TTTCCNCACC GAAGCAAGCT 36 0 
AGAAAACTCT NTCGAGATAG AGAAAGGGTT 42 0 
CATGACCGGG TTTTTNAGCA AGANCCCTGG 4 80 

495 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2 04 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AAATTAAAAC AAATCAATGT TGATATTGAA TCCAATAGTG CTTGGTTTCA ACCTGGTGCT 60 

ACGCTTGGTG AGCTTTACTA C AG AATTN C A GAGAAGAGCA AAATCCATGG ATTTCCNGCG 12 0 

GGTTTNTNCA CAAGCNTAGG CATAGGTGGG TATATNANAG GCGGTGGATA CGGTACCTTG 180 

ATGAGGAAGT ATGGTCTTNC GGGA 2 04 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: CDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27: 

GAGATTTCTC GAGCAAGATA CTCCACTGAT GATCTTTGAG CCATTGGGTG GGAAAATCAG 60 

CAAGATTTCA GAAACAGAAT CTCCATATCC ACACAGAAGA GGTAATCTGT ATAATATACA 12 0 

GTACATGGTG AAATGGAAAG TGAATGANGT CGAGGAGATG AACAAACATG TCAGGTGGAT 180 

GAGATCGTTA CACGATTACA TGACTCCGTA TGTTTCTAAA TCGCCGAGAG GAGCTTATTT 24 0 

GANTTACAGA GATCTTGATT TGGGCTCGAC CAAAGGGATT AACACGGGTT TCGGAGATGC 3 00 

AAGGAAATGG NNGGGTGAGN CTTTTTTCAA AGGTAATTTC CAAGGGGTTA GGTTTTGGTT 3 60 

AAAGGGGAGG TTTNNCCCAN CAAATTTTTT TTCAGGANCC GGCCANGNTT TTCCCCCCCC 42 0 

TNTTTTTNGG NCCCCAATCN AAANCCCCGT TTTAAAAGGG GGGCCATTTC NTTTTTTNCA 4 80 

NNTTAAAAGG G 4 91 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 



(A) NAME/ KEY : CDS 

(B) LOCATION: 3 . .407 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 28: 

ATTTGTTCGT GAGGTTAACT TTGACTTTAG TCAACGGTAC GAAGCCTGGT GAGAATACGG 60 

TTTTAGCGAC TTTCATTGGG ATGTATTTAG GCCGGTCGGA TAAGCTGTTG ACCGTNATGA 12 0 

ACCGGGATTT CCCGGAGTTG AAGC TGAAGA AAACCGATTN TACCGAGATG AGATGGATCG 18 0 

ATTCGGTTCT GTTTTGGGAC GATTATCCGG TTGGTACACC GACTTCTGTG CTACTAAATC 24 0 

CGCTAGTCGC AAAAAAGTTG TTCATGAAAC GAAAATCGGA CTACGTGAAG CGTCTNATTT 3 00 

TCGAGAACCC GATCTCNNGT TTGATACTCA AG AAATTTG T AG AGG TTNNG AAAGTTAAAA 3 60 

TNAATTTGGA TCCGCATTNN GGNANNNATG GTGAAACCCC NNGTTNT 4 07 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 3 . .3 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 29: 
ACGGCGTCGT ATTGGCCTAC AAAATAAACC TTGTTGAAGT CCCAGAAAAC GTCACCGTTT 60 
TCAGAATCTC CCGGACGTTA GAACAAAATG CGACGGATAT CATTCACCGG TGGCAACAAG 12 0 
TTGCAC CGAA GCTTCCCGAC GAGCTTTTCA TAAGANCAGT CATTGACGTA NAAACGGCAC 18 0 
TGTTTCATNN CTCAAAAGAC CGTCAGACAA CATTCATAG C AATGTTTCTA GGAGACACGN 24 0 



CAACTCTACT GTCGATATTA AAC C GG AG AT TCCCAGAATT GGGTTTGGTC CGGTC TGACT 3 00 



GTACCGNAAC AAGCNNTTGG ATCCAATCTG TGCTATTTTT GGGACAAATA TCCCAGGTTG 3 60 



(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 3.. 427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TCTTCACTGT CACCAAAACG TTAGAACAAG ACGCAAGATT GAAGACTATT TCTAAGTGGC 60 
AACAAATTTC ATCCAAGATT ATTGAAGAGA TACACATCCG AGTGGTACTC AGAGCAGCTG 12 0 



GAAATGATGG AAACAAGACT GTGACAATGA 

GCACCTTGCT GAAGGTTATG GAGAAGG CTT 

GTACTGAAAT GAGCTGGATT GAAGCCGCCC 

CTATTGAAAT TTTGCTTMAG CTCAAGTCGC 

CGGATTTCGT TAAAGAACCT WTTCCTGTGA 
TTGAAGG 



CCTACCTAGG TCAGTTTCTT GGCGAGAAAG 180 
TTCCAGAACT AGGGTTAACT CAAAAGGATT 24 0 
TTTTC CATGG TGGRTTTCCA ACAGGKTCTC 300 
CTYTAGGAAA AGRTTWCTTC AAAGCAACGK 3 60 
TAGGGCTCAA AGGAATATTC AAAAGATTGA 42 0 

427 



(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) 



HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 
GTTGTACTAT CATNGAAGAT TAAGTTAGTC GATGTTCCGT CCACGGTCAC CGNGTTTAAA 60 
GTCCAGAAAC ATNAGGAGAA AGAGGCCGTT AGGNTCATCA ACAAGTGGCA GTATGTTGCG 12 0 
GATAAGGTCC CTGAAGATCT TTTCATCAGC GCAACGTTGG NGAGATCAAA CGGAAACTCT 18 0 
GTGCAGGCTT TGTTTACTGG ACTCTATCTT GGNC CGGTGA ATAATNTCTT GGCCTTGATG 24 0 
GAAGAAAAGT TTCCAGANTT AGGTCTTGAT ATCCAAGNCT GCACAGAGAT GAGTTGGGCT 30 0 
GAATCTGCAC TCTGGTNTNC TGNTTTCNCT AAAGGAGAGN CTCCTTGGGT GTTCCNCGCG 3 60 
GATCGGNAGC GGNCAATTTN TGGNCTTTCA AGGGGAAAGN CGGCTTTTTN CAAGAACCCG 42 0 
NTACCCGGGG TTCAATT 43 7 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . .441 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GCGGACCCTA TAGATCANNA TGTGCTACTG ANAGAAGAGG AAGCCAAGAA CAAGCCGGAG 6 0 



1 



ACAGATAAAT ATCTGAAATG GGNCGATANC GTTTACGAAT TTATGACNCC ATATGTTTCG 12 0 

AAATCTCCAA GAGGAGCTTA TGTCAATTTC AAGGATATGG ATTTGGGTAT GTATCTTGGA 180 

AAGAAGAAGA CAAAGTACGA GGAAGGAAAG AGTTGGGGAG TGAAGTATTT CAAGAACAAT 24 0 

TTCGAGAGAT TGGTGAGAGT GAAGACTAGG GTTGATCCAA CAGATTTCTT CTGCGATGAA 3 00 

CAGAGCATTC CTCTGGTGAA CAAAGTTACC TGAAGATATC ATTTGAAGTT TTTTATTAGT 3 60 

CCCTTTTCTC TGTGAAATCA TCTGTGCGTG TTGAATATTA TGCGTCAAGT GTGTAACTTA 420 

TGTGTGTGAT TGTGAATTGT G 441 

(2) INFORMATION FOR SEQ ID NO : 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2.. 502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CTGGCTTAAC ACAACGTCGT TTTGGGCCAA TTACCCGGCG GGTAC AC C C A AGAGCATCCT 6 0 

TCTAGATAGG CCTCCGACGA ATTCAGTGTC ATTTAAGAGT AAATCGGATT TTGTCAAAAA 12 0 

AC C AATAC CC AAAAAAGGTT TAGAGAAGCT TTGGAAGACA ATGTTTAAAT TCAACAGTAG 18 0 

CGTCTCGTTG CAATTCAACC CTTACGGTGG AGTGATGGAC CGGATTCCGG CAACGGCCAC 24 0 

CGCTTTTCCT CATCGGAAAG GAAACTTGTT CAAGGTTCAA TACNCTACGA TGTGGTTTGA 3 00 

CGCAAACGCC ACACAGAGTA GCCNGGCTAT GATGAATGAG CTTTTTGAGG TGGCGGGACC 3 60 

GTACGTGNGT CAAGTAAACC CGAGANANGG CTTCCTTTAA NTT C AG AGNC CATCGNTNTT 42 0 



NGGAGCAANN CCAAGTGGGG GGGNCCAACC GGGGGNTNAA ANCNNAGNTC TTNGGGGGCC 4 80 



CAGAATTTCC TTNGGGGAAT TT 



502 



(2) INFORMATION FOR SEQ ID NO : 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2 . .4 00 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

NGGGAATTGC NCGAGGNAAG TTGTACCCAA TTCCTGGACC ACCATTGGTT TCCCAAGAAN 6 0 

CCCGAGACAA CCGTTTTTCA ATNACCGTGA TGTTGATTTG GGTATTAATT CTCATAATGG 12 0 

TAAAATCAGT AGTTATGTGG AAGGTAAACG TTACGGGAAG AAGTATTTCG CAGGTAATTT 180 

CGAGAGATTG GTGAAGATTA AGACGAGAGT TGATAGTGGT AATTTCTTTA GGAACGAACA 24 0 

GAGTATTCCT GTGTTACCAT AAGTGTATTT ATTTGATTAT TGGTTAGTGA AATTTGTTGT 3 00 

TGTATAATGA TTATATGTCG TATTTTTATT TATTATTAGT AAT T TAT AAA GTTTGATATT 3 60 

AAATACAAAT AGTATAATAA GATAGTTTCT TTTAGTAAAA 4 00 



(2) INFORMATION FOR SEQ ID NO : 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2 . . 383 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CAACTCTAAT GGGAACACCT AC TTCG ATCG AATGT CGATG GGGGAAGAGC TTTTCTGGGC 60 
GGTTCGAGGA GGTGGAGCCG CGAGTTTCGG CATCGTGATG GGATACAAAA TCCGGTTGGT 12 0 
TCCGGTTCCG GAGAAAGTTA CGGTTTTTAG CGTCGGAAAA ACCGTCGGAG AAGGAGCCGT 18 0 
TGATCTTATA ATGAAGTGGC AGAACTTCTC TCATAGTACG GNTCGGAATT TNTTTGTGAA 24 0 
GCTGANTTTT GANTTTAGTC AACGGTGCAA AGCCGGGTGA AAAAAAGGTT TTAGNGNCTT 3 00 
TCANTTTGGN TGNAANCTTG G GGGTTTT AT NAGAACGGTT AACCGGGATT NANCCCGNGT 360 
TTTCCCGGGG TTAAAACCTT NGG 3 83 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . .3 54 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 
ATCAATGTTC TTACTAAACG TACACGAGCA TCGTTGGCTT TCAAGGCTAA ATCTGATTTT 6 0 
NTTCAAGAAC CGATNCCTAA AACCGCGATT TCGAAGCTTT GGAGACGGTT GCAAGAACCG 12 0 



GAAGCAGAGC ATGCTCAGCT AATTTNCACN CCATTTGGTG GTAAAATGAG TNAGATTGCA 180 



GATTACGAAA CACCATTTCC GCATAGGAAG GGGAATATAT ATNAGATTCA GTACTTGAAT 24 0 
T AC TGG AG AG GAGACGTGAA AGAGAAGTAT ATTGAGATNG GTGGAGGAGA GTTTACGGTT 3 00 
GNTATNAGTA AGTTTTTTGG CGAAGTNTNC CNAGAGGNGN CTTNNTNTAA ACCT 3 54 



(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidposis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

TTTTTTAGTA CACTAATAAT CAAATGGAAT GAGAAATGAA GC C AC AAAAG TATCTGCAAT 6 0 

CAAAATATCC TGCTATCTCC ATCTCAAGCT CTCAATAGTA TCCTCTCCGA AAGTGAAATC 12 0 

AACATTTCAA ACTCTATTTC TTGGTGGAAT CGATAGACTG ATTCCTCTGA TGAACCAGAA 180 

GTTTCCGGAA CTCGGCTTAC GATCTCAAGA CTGTTCGGAA ATGAGCTGGA TCGAATCGAT 24 0 

AATGTTCTTC AACTGGAGAT CAGGACAGCC GTTAGAGATT TTGCTCAACA GAGAC CTAAG 3 00 

GATTCGAGGA TCAGTATTTC AAAGCAAAGT CAGGATTATG GTTCAAAAAC CCGTTCCTGA 360 

AAACGTTTTT CGAAGAGGTA TCCAAGGGGT TTCTCGAGCA AGT 403 



(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : double 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAGATGAGTT GGATTAANTC TGTACTCTGG TTTGCTGATT TCCCTAAAGG AGAATCTCTT 60 
NGTGTTCTCA CGAATCGTAA GCGTACATCT CTATCTTTNA AAGGCAAAGA TGATTTTATC 12 0 
CAAGAACCGA TACCCGAGGC TGCAATTNAA GAGATATGGA GGCGATTAGA AGCCCCCNAG 180 
GCTCGGCTTG GAAAGATCAT ATTAACTCCA TTTGGTGGGA AAATNAGTGA AATGGCAGAG 240 
TACGTANCAC CATTCCCACA 2 60 



(2) INFORMATION FOR SEQ ID NO : 3 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2 . . 605 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 
CTCTTGCATA TTCGCTGCAA GGATGGGAAA TTCAAAACCA CTCCCTACAA TTTTTTGTAT 60 



TATAGTTTCA GTCTTGTATT TTTAATTCTA TTGCATAACA CCAACTTCTT CATCAGCCTC 12 0 
CAT C C AAG AT CAATTCATAA ACTGTGTCAA AAGAAACACA CATGTTTCTT TTCCACTCGA 18 0 
GAAAACGTTA TTCACCCCTG CGAAAAACGT CTCTTTGTTC AACCAAGTCC TTGANTCGAC 24 0 
GGCTCAAAAT CTCCAGTTCT TGGCAAAATC CATGCCTAAA CCGGGRTTCA TATTCAGACC 3 00 
GATTCACCAG TCTCAAGTCC AAGSTTCCAT CATTTGTTCA AMGRAACTCG GGNTTCATTT 3 60 
TNGTGTTTGA NGTGGCGGTC ACGATTTTCG AGGCCTTTGT NTTTATGTTT CACGGTTTGA 42 0 
AAAAACCGTT TATATTACTC GGCCTGTCAA ANTTGNANNC AAAATCANAT GTTGGATATT 4 80 
GNATTC CAAA TAGGTNCTTG GGGTNAACCT GGTGGCTANC GTTTGGTGAG CTTTTACTTT 54 0 
CAAGAATTTG CANGNGGANG TGCAAAGATT CCATGGGATT TCCCGGGGGG TTTNTTGCAC 6 00 
AATGT 605 

(2) INFORMATION FOR SEQ ID NO : 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2.. 464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

AACACAAAAC TCTTCCATTT GGCTTCTCTC TTG CATATTC GTTGCAAGGA TGGGAAATTC 60 

AAAACCACTC CCTACAATTN CTTGTATTAT CGTTTCAGTC TTGTATTTTN NATTCTATTG 12 0 

CATAACACCA ACTTCTTCAT CAGCCTCCAT CCAAGNTCAA TTCATAAACT GTGTCAAAAG 18 0 

GAACACACAT GTTTCTTTTC CACTCGAGNA AACGGTATTC ACTCCTGCGG AAAACGGCTC 24 0 

TNTTATTCAA CGGGTCCNTG AATCGACGGG TCAAAATCTC CAGTTCTTGG NAAAATCCAT 30 0 



GNCTAAACCG GGGTTCATAT TCAGGCCGGT TCACCAGTCT CAAGTCCAAG NTTCCATCAT 3 60 



TTGTTCAAAG GAACTCGGGA TTCATTTCCG CGNTAGAAGT GGCGGGCANN GGTTTCGGGG 42 0 
CCTGTCTNTT GNTTANGGGN AGGAAAACCG GTTNTATTNC TCGG 4 64 



(2) INFORMATION FOR SEQ ID NO : 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . .3 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 

TCGGGAGCCC ANGNTAAATT ANNTGAAAAT GGGGNCGNAT ANC CGTTTAC NGAATTTTAT 60 

GACNCCCAAT ATGTTTCGAA ATCTCAAAGA NNGGGANCTT ATGTCAATTT CAAGGATATG 12 0 

GATTTGGGTA TGTATCTTGG AAAGNAGAAG ACAAAGTACG AGGAAGGAAA GAGTTGGGGA 18 0 

GTGAAGTATT TCAAGAACAA TTTCGAGAGA TTGGTGAGAG TGAAGACTAG GGTTGATCCN 240 

ACAGATTTCN TCTGCGATGA ACAGAGCATT CCTCTGGTGN ACAAAGTTAC CTGAAGATAT 3 00 

CATTTGAAGT TTTTTATTAG TCCCTTTTCT CTGTGAAATC ATCTGTGCGT GTTGAATANT 3 60 

ATGCGTCAAG TGTGTAACTT ATGTGT 3 86 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNE S S : doub 1 e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . . 377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TACCATAGGG AGGTGGTGNA AGATTTTGTA TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 60 
TGGTGGTGGG GAGCTGTAAA CGTATGGTGG TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 
TTCATTGAAG CTAAAATCTG GGGACCTAAG TACTTCAAAG GCAATTTTGA CAGATTGGTG 180 
AAGATTAAAA CCAAGGTTGA TCCAGAGAAC TTCTTCAGGC ACGAGCAGAG TATCCCACCT 24 0 
ATGCCCTACT AGAAGCTAGG TTCATGAAAC CAATAACATT ATCAAAAATA AGAATAAATG 3 00 
ATAATTGTAT ACAACATGAT TCGTCTTTCT TTATTTCAGA CAATGTGGAC ACTACTCTAA 3 60 
ATAAAATGTC ATTTACC 377 



(2) INFORMATION FOR SEQ ID NO : 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



TACCATAGGG AGGTGGTGNA AGATTTTGTA 
TGGTGGTGGG GAGCTGTAAA CGTATGGTGG 
TTCATTGAAG CTAAAATCTG GGGACCTAAG 
AAGATTAAAA CCAAGGTTGA TCCAGAGAAC 
ATGCCCTACT AGAAGCTAGG TTCATGAAAC 
ATAATTGTAT ACAACATGAT TCGTCTTTCT 
ATAAAATGTC ATTTACC 



TGTAGNCTTA GGGGAAGGCG AGTAGTATGG 6 0 

TGGTGGAGAT TTGTATGTGG GCTGGTTAAC 12 0 

TACTTCAAAG GCAATTTTGA CAGATTGGTG 180 

TTCTTCAGGC ACGAGCAGAG TATCCCACCT 240 

CAATAACATT ATCAAAAATA AGAATAAATG 3 00 

TTATTTCAGA CAATGTGGAC ACTACTCTAA 3 60 

377 



(2) INFORMATION FOR SEQ ID NO : 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 346 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GAGCTGTGGA TATGGTCACA AATGGCAATC GGTTGGTCCG AAAACTGATC CGAATCTTTT 6 0 
TATGAGAATN TTGATTCAAC CAGTGACGAG GAAGAAGGTA AAGACTGTGA GAGCTTC TNT 12 0 
GGTTGCCCTN TTTTNAGGCN AGACAGATGA AGTTTTTGCT TTCCTTAGTA AGGAGTTTCC 180 
TGAATTGGGT TTAAAGAAGG AGAATTNTTC GGAGATGACT TGGTTTCANT CTGCTTTATG 24 0 
GTGGGACAAT CGTCTTAATG CTACTCAGGT TGATCCTAAA GTNTTTCTTG ATCGGAATCT 3 00 
CGATACCTCG AGTTTCGGTA AGAGGAAATC GGATTACGTC GCGACT 34 6 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ATGGGGTGAG ACTTATTTCA AAGGTAATTT CAAGAGATTA GGTTTGGTTA AAGGGAAGNT 60 

TGATCCAACA AATTTCTTCA GGAACGAACA GAGTATTCCT CCTCTGTTTT GAGTCCTCAA 12 0 

TACAAAACCA GATATAAAAG ATGTCATTTC ATTTTTTCAA T T AT AAT AG A TAATGTAACT 18 0 

TTCTGCTACA ATTGTAAAAG TGAGATGTAC CCAATACGGT TTAAGCGGAC CGAGAATAGT 24 0 

CAATTCAAAG ACCAAATTCT G 261 



(2) INFORMATION FOR SEQ ID NO : 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..478 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46: 
GCTCAAAGGA CTAACCATGA AAACTTCCTC AAGTGTCTCT CTCACCGANT CAACGAGGAC 60 
GACTCAAGAN TTATACACAC ATCAAAAGAT CCTTCGTATT TNTCAATCTT GATTTCTTCC 12 0 
ATACAAAATC CAAGTTTCTC TGTTCTCGAA ACACCTAAAC CGGTTTCAAT CATCACTCCG 180 
GTTCAAGCCA CCGATGTTCA ATCTACGNTT AAATNCGCAC GGNCTTCACG GGTATACACA 24 0 
ATCAGGGCTA GGAGTGGTNG TCATGACTAC GGAGGTTTAT CTTTACATTG GCTTAAAAAN 3 00 
CANNCCGTTC GTTNNTCATT GATTTNNAGA AATCTTCCGG GCTTATTTAA CATNTAAGAT 3 60 
GTTTGATAAN CCGGNNCCNG TTTGGGGTTC AAATCCCGGT GGCTTACAAA NTTNGGGGGA 42 0 
ATTGTNCCTA TGAGGTTTGG AAAATTAANG CAAAATNTTT TGGGCCTTCC CGGCCGGT 478 

(2) INFORMATION FOR SEQ ID NO : 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: ecotype Columbia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2.. 579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

GGCCGTTAGG ATCATCAAGA AATGGCAATA TGCTGCAGAT AAGGTTCCTG ATGATCTTTT 60 

CATTAGGACA ACATTGGAGA GATCAAACAA GAACGCAGTA CACGCTTTGT TCACTGGACT 12 0 

ATATATTGGT CCGGTGAACA ATCTATTGGC GTTGATGGAA GAAAAGTTTC CGGAACTAGG 180 

TCTTGAGAAA GAAGGTTGTG AAGAGATGAG TTGGATTGAG TCTGTACTCT GGTTTGCTGA 24 0 

TTTCCCTAAA GGAGAATCTC TTGGTGTTCT CACGAATCGT GAG CGT AC AT CTCTATCTTT 300 

C AAAGG C AAA GATGATTTTG TCCAAGAACC GATACCCGAG GCTGCAATTC AAGAGATATG 360 



GAGGCGATTA GAAGCCCCCG AGGCTCGGCT TGGAAAGATC ATATTAACTC CATTTGGGTG 42 0 



NGGNAAAATG AGTGAAATGG CAGAGNCCGA ACCACCAATT CCCACANNCG AGGGAGGGGA 4 80 
ACCCCTNTGN GGNTCAGAAT GTGGTTCCTG GNNNNNAAGN GGGNGCCAGN ACCAANCCGG 54 0 
GNCNGTAAAN CNTGNAATGG GCCNAACCCG TNCCGGATT 57 9 



(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
<iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Oryza sativa 

(B) STRAIN: Nipponbare, subsp . japonica 

(D) DEVELOPMENTAL STAGE: etiolated shoot (8 days old) 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 3 . .252 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 48: 

TGTCCTGGAA GGTCCGCCTC GTGCAGGTTN CGACGACGGT GACGGTGTTC GTCGTCGGGA 6 0 

GGAACGTCGA CCAGGGCGCC GCNGACGTCG TCGCCAGATG GCAAGACGTC GCGCCGAGCC 12 0 

TCCCTCCCGA GCTCACCATA CGGGTGATCG TNCGAGGGCA GCGCGCCACG TTCCAGTCGC 18 0 

TGTACCTCGG CTCGTGCGCC GACCTGGTGC CGACGATGAG CAGCATGTTC CCGGAGCTCG 24 0 
GGATGACGAT TG 2 52 



(2) INFORMATION FOR SEQ ID NO : 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- s i te 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels Ambiguous 
/note= "Xaa = Cys or Ser" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 0.. 21 

(D) OTHER INFORMATION: /label= ambiguous 

/note= "Xaa-Xaa probably is Ser-Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Xaa Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Xaa Xaa 
20 



(2) INFORMATION FOR SEQ ID NO : 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied- s ite 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= unknown 



(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Ser" 

(ix) FEATURE : 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Trp" 

(ix) FEATURE: 

(A) NAME/ KEY : Modified- site 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /label= ambiguous 
/note= "Xaa = probably Tyr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50: 

Xaa lie Xaa Val Xaa lie Glu Asp Glu Thr Ala Xaa Val Gin Ala Gly 
15 10 15 

Ala Thr Leu Gly Glu Val Tyr Xaa 
20 



(2) INFORMATION FOR SEQ ID NO : 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 51: 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Tyr Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO : 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



(C) 
(D) 



STRANDEDNESS : single 
TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTTCTACTT CTATTATTGA TAGGTTTACT CA 3 2 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 

(B) STRAIN: lollo bionda 

(ix) FEATURE: 

(A) NAME/ ICE Y : CDS 

(B) LOCATION: 1..405 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ACT TCT ACT TCT ATT ATT 
Thr Ser Thr Ser lie lie 
1 5 

GCT GAC CCT TCT TTC CCG 
Ala Asp Pro Ser Phe Pro 
20 

TCC TCT TTT CCA TCC GTC 
Ser Ser Phe Pro Ser Val 
35 

AAT GAA TCC ACG ACT CCC 
Asn Glu Ser Thr Thr Pro 
50 

CCT TCA CAC ATT CAA GCA 
Pro Ser His lie Gin Ala 



GAT AGG TTT ACT CAA TGT CTA AAC AAC CGA 4 8 

Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
10 15 

CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC 96 
Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
25 30 

TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC 144 
Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe 
40 45 

AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC 192 
Lys Pro lie Leu lie lie Thr Ala Leu His 
55 60 

GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG 24 0 
Ala Val Val Cys Ala Lys Thr His Arg Leu 



65 



70 



75 



80 



CTA ATG AAA ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT 2 88 
Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

GTG ACC AAT TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA 336 
Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

CGC TCC ATA AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCC 3 84 
Arg Ser lie Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 

GGC GCC ACC CTC GGA GAA GTT 4 05 

Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Thr Ser Thr Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg 
15 10 15 

Ala Asp Pro Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn 
20 25 30 

Ser Ser Phe Pro Ser Val Leu Gin Ala Tyr He Arg Asn Leu Arg Phe 
35 40 45 

Asn Glu Ser Thr Thr Pro Lys Pro He Leu He He Thr Ala Leu His 
50 55 60 

Pro Ser His He Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu 
65 70 75 80 

Leu Met Lys Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr 
85 90 95 

Val Thr Asn Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu 
100 105 110 

Arg Ser He Asn Val Ser He Glu Asp Glu Thr Ala Trp Val Gin Ala 
115 120 125 



Gly Ala Thr Leu Gly Glu Val 
130 135 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACGTTTATG GAGCGTAAGT TGAAC 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACCCTTCAC ACATTCAAGC AGC 



(2) INFORMATION FOR SEQ ID NO : 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactuca sativa 



(B) STRAIN: 



lollo bionda 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 7.. 1626 

(ix) FEATURE: 

(A) NAME /KEY : unsure 

(B) LOCATION: replace (372, "g") 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: replace (379, "g") 

(ix) FEATURE: 

(A) NAME /KEY : unsure 

(B) LOCATION: replace (786, "t") 

(ix) FEATURE: 

(A) NAME /KEY : unsure 

(B) LOCATION: replace ( 1 10 5 . . 110 6 , "ga") 

(D) OTHER INFORMATION: /note= "also possible "gg" and "aa" " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ACAAAA ATG GCA ATT ACC TAT TCT TTC AAC TTC AAA TCT TAT ATT TTT 4 8 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe 
1 5 10 

CCT CTC CTC CTT GTC TTG CTC TCT ACC CAT TCA TCA GCG ACT TCA ACT 96 
Pro Leu Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr 
15 20 25 30 

TCC ATT ATA GAT CGC TTC ACC CAA TGT CTA AAC AAC CGA GCT GAC CCT 144 
Ser lie lie Asp Arg Phe Thr Gin Cys Leu Asn Asn Arg Ala Asp Pro 
35 40 45 

TCT TTC CCG CTC AGT GGA CAA CTT TAC ACT CCC GAT AAC TCC TCT TTT 192 
Ser Phe Pro Leu Ser Gly Gin Leu Tyr Thr Pro Asp Asn Ser Ser Phe 
50 55 60 

CCA TCC GTC TTG CAA GCT TAC ATC CGG AAC CTC CGA TTC AAT GAA TCC 24 0 
Pro Ser Val Leu Gin Ala Tyr lie Arg Asn Leu Arg Phe Asn Glu Ser 
65 70 75 

ACG ACT CCC AAA CCC ATC TTA ATC ATC ACC GCC TTA CAC CCT TCA CAC 2 88 
Thr Thr Pro Lys Pro lie Leu lie lie Thr Ala Leu His Pro Ser His 
80 85 90 

ATT CAA GCA GCT GTT GTG TGC GCC AAA ACA CAC CGC CTG CTA ATG AAA 336 
lie Gin Ala Ala Val Val Cys Ala Lys Thr His Arg Leu Leu Met Lys 
95 100 105 110 

ACC AGA AGC GGA GGC CAT GAT TAT GAG GGG CTT TCC TAT GTG ACC AAT 3 84 



Thr Arg Ser Gly Gly His Asp Tyr Glu Gly Leu Ser Tyr Val Thr Asn 
115 120 125 



TCG AAC CAA CCC TTT TTT GTT GTT GAC ATG TTC AAC TTA CGC TCC ATA 432 
Ser Asn Gin Pro Phe Phe Val Val Asp Met Phe Asn Leu Arg Ser lie 
130 135 140 



AAC GTG AGT ATT GAA GAT GAA ACT GCA TGG GTC CAA GCT GGT GCG ACT 480 
Asn Val Ser lie Glu Asp Glu Thr Ala Trp Val Gin Ala Gly Ala Thr 
145 150 155 

CTT GGT GAA GTC TAC TAC CGA ATA GCA GAG AAA AGC AAC AGT CAT GCT 52 8 
Leu Gly Glu Val Tyr Tyr Arg lie Ala Glu Lys Ser Asn Ser His Ala 
160 165 170 

TTT CCG GCT GGC GTT TGC CCT ACT GTT GGA GTT GGT GGC CAT TTT AGT 57 6 
Phe Pro Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser 
175 180 185 190 



GGT GGT GGT TAT GGT AAC TTG ATG GGA AAA TAC GGC CTT TCT GTT GAC 624 

Gly Gly Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp 

195 200 205 

AAT ATT GTC GAT GCT CAG TTA ATC GAT GTG AAT GGT AAA CTT CTG AAT 672 

Asn lie Val Asp Ala Gin Leu lie Asp Val Asn Gly Lys Leu Leu Asn 
210 215 220 



CGG AAA TCA ATG GGT GAA GAT CTT TTT TGG GCC ATC ACA GGT GGT GGT 72 0 
Arg Lys Ser Met Gly Glu Asp Leu Phe Trp Ala lie Thr Gly Gly Gly 
225 230 235 



GGT GTC AGC TTT GGT GTG GTT GTA GCG TAC AAG ATC AAA CTG GTT CGT 76 8 
Gly Val Ser Phe Gly Val Val Val Ala Tyr Lys lie Lys Leu Val Arg 
240 245 250 



GTT CCT ACC ACT GTG ACC GTT TTT AAC GTA CAA AGA ACA TCC GAG CAG 816 
Val Pro Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin 
255 260 265 270 



AAC CTA AGC ACC ATA GCC CAC CGA TGG ATA CAA GTT GCG GAT AAG CTC 864 
Asn Leu Ser Thr He Ala His Arg Trp He Gin Val Ala Asp Lys Leu 
275 280 285 

GAT AAT GAC CTT TTC CTT CGA ATG ACC TTT AAC GTG ATA AAC AAC ACA 912 
Asp Asn Asp Leu Phe Leu Arg Met Thr Phe Asn Val He Asn Asn Thr 
290 295 300 



AAT GGC GAA AAG ACG ATA CGT GGT TTG TTT CCA ACA CTG TAC CTC GGA 960 
Asn Gly Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly 
305 310 315 



AAC TCT ACC GCT CTT GTT GCC CTC CTG AAC AAG GAT TTC CCT GAA TTA 1008 
Asn Ser Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu 
320 325 330 



GGT GTA GAA ATT TCA GAT TGT ATT GAA ATG AGT TGG ATC GAG TCT GTT 105 6 
Gly Val Glu lie Ser Asp Cys lie Glu Met Ser Trp lie Glu Ser Val 
335 340 345 350 



CTT TTC TAC ACA AAC TTC CCC ATT GGT ACT CCG ACC ACT GCT CTT CTA 1104 
Leu Phe Tyr Thr Asn Phe Pro lie Gly Thr Pro Thr Thr Ala Leu Leu 
355 360 365 

AGC CGT ACA CCT CAA AGA CTA AAC CCA TTC AAA ATC AAA TCT GAT TAC 1152 
Ser Arg Thr Pro Gin Arg Leu Asn Pro Phe Lys lie Lys Ser Asp Tyr 
370 375 380 

GTA AAA AAC ACT ATT TCC AAA CAG GGA TTC GAA TCC ATA TTT GAA AGG 12 00 
Val Lys Asn Thr lie Ser Lys Gin Gly Phe Glu Ser lie Phe Glu Arg 
385 390 395 

ATG AAA GAA CTC GAA AAC CAA ATG CTA GCT TTC AAC CCT TAT GGT GGA 124 8 
Met Lys Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly 
400 405 410 

AGA ATG AGC GAA ATT TCC GAA TTT GCA AAG CCT TTT CCC CAT CGA TCA 12 96 
Arg Met Ser Glu lie Ser -Glu Phe Ala Lys Pro Phe Pro His Arg Ser 
415 420 425 430 

GGG AAT ATA GCG AAG ATC CAA TAC GAA GTA AAC TGG GAT GAA CTT GGC 1344 
Gly Asn lie Ala Lys lie Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly 
435 440 445 

GTT GAA GCA GCC AAT CGG TAC TTG AAC TTC ACA AGG GTG ATG TAT GAT 13 92 
Val Glu Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp 
450 455 460 

TAT ATG ACT CCG TTT GTT TCT AAG AAC CCC AGG GAA GCA TTT CTG AAC 144 0 
Tyr Met Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn 
465 470 475 

TAC AGG GAT TTA GAT ATT GGT GTC AAC AGT CAT GGC AAG AAT GCT TAC 14 8 8 
Tyr Arg Asp Leu Asp lie Gly Val Asn Ser His Gly Lys Asn Ala Tyr 
480 485 490 

GGT GAA GGA ATG GTT TAT GGG CAC AAG TAT TTC AAA GAG ACG AAT TAT 153 6 
Gly Glu Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr 
495 500 505 510 

AAG AGG CTA ACG ATG GTG AAG ACG AGG GTT GAT CCT AGC AAT TTT TTT 15 84 
Lys Arg Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe 
515 520 525 

AGG AAT GAG CAA AGT ATC CCA ACT TTG TCA TCT TCA TGG AAG 162 6 

Arg Asn Glu Gin Ser lie Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 



TAAATTCTAA ATTCACTTGT GAAATTGAAT AAAAGTATGG CTTTTTCAAG GTCATGGTAT 16 8 6 



CCAGATTCAG ATGATATTGA TATAATTTTG ACTTGTATTT ATACAAACAA AATTATATTA 174 6 
TATTTTTCTG AATTTAGATT TTCCATTCTT TGGAAAAATA TACGAACATT GATGTTGATA 18 06 
TTTTTAAGAA TTATAGATTT TGAACATTGT GAACAATGAA TAAACCGAGG ACTTCCCTTG 18 6 6 
GGTTTTTTTT ATAAGTATGT AATAGCATGT CTTTAATCAA GATAACCGAT CATTGGATGC 192 6 
AATTTATTAT TATAAACCTT ATTTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 1981 

(2) INFORMATION FOR SEQ ID NO : 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala lie Thr Tyr Ser Phe Asn Phe Lys Ser Tyr lie Phe Pro Leu 
1 5 10 15 

Leu Leu Val Leu Leu Ser Thr His Ser Ser Ala Thr Ser Thr Ser lie 
20 25 30 



lie Asp Arg Phe 
35 

Pro Leu Ser Gly 
50 

Val Leu Gin Ala 
65 

Pro Lys Pro lie 



Ala Ala Val Val 
100 

Ser Gly Gly His 
115 

Gin Pro Phe Phe 
130 

Ser lie Glu Asp 
145 

Glu Val Tyr Tyr 



Thr Gin Cys Leu 
40 

Gin Leu Tyr Thr 
55 

Tyr lie" Arg Asn 
70 

Leu lie lie Thr 
85 

Cys Ala Lys Thr 



Asp Tyr Glu Gly 
120 

Val Val Asp Met 
135 

Glu Thr Ala Trp 
150 

Arg lie Ala Glu 
165 



Asn Asn Arg Ala 



Pro Asp Asn Ser 
60 

Leu Arg Phe Asn 
75 

Ala Leu His Pro 
90 

His Arg Leu Leu 
105 

Leu Ser Tyr Val 



Phe Asn Leu Arg 
140 

Val Gin Ala Gly 
155 

Lys Ser Asn Ser 
170 



Asp Pro Ser Phe 
45 

Ser Phe Pro Ser 



Glu Ser Thr Thr 
80 

Ser His lie Gin 
95 

Met Lys Thr Arg 
110 

Thr Asn Ser Asn 
125 

Ser lie Asn Val 



Ala Thr Leu Gly 
160 

His Ala Phe Pro 
175 



Ala Gly Val Cys Pro Thr Val Gly Val Gly Gly His Phe Ser Gly Gly 
180 185 190 



Gly Tyr Gly Asn Leu Met Gly Lys Tyr Gly Leu Ser Val Asp Asn He 
195 200 205 

Val Asp Ala Gin Leu He Asp Val Asn Gly Lys Leu Leu Asn Arg Lys 
210 215 220 

Ser Met Gly Glu Asp Leu Phe Trp Ala He Thr Gly Gly Gly Gly Val 
225 230 235 240 

Ser Phe Gly Val Val Val Ala Tyr Lys He Lys Leu Val Arg Val Pro 
245 250 255 

Thr Thr Val Thr Val Phe Asn Val Gin Arg Thr Ser Glu Gin Asn Leu 
260 265 270 

Ser Thr He Ala His Arg Trp He Gin Val Ala Asp Lys Leu Asp Asn 
275 280 285 

Asp Leu Phe Leu Arg Met Thr Phe Asn Val He Asn Asn Thr Asn Gly 
290 295 300 

Glu Lys Thr He Arg Gly Leu Phe Pro Thr Leu Tyr Leu Gly Asn Ser 
305 310 315 320 

Thr Ala Leu Val Ala Leu Leu Asn Lys Asp Phe Pro Glu Leu Gly Val 
325 330 335 

Glu He Ser Asp Cys He Glu Met Ser Trp He Glu Ser Val Leu Phe 
340 345 350 

Tyr Thr Asn Phe Pro He Gly Thr Pro Thr Thr Ala Leu Leu Ser Arg 
355 360 365 

Thr Pro Gin Arg Leu Asn Pro Phe Lys He Lys Ser Asp Tyr Val Lys 
370 375 380 

Asn Thr He Ser Lys Gin Gly Phe Glu Ser He Phe Glu Arg Met Lys 
385 390 395 400 

Glu Leu Glu Asn Gin Met Leu Ala Phe Asn Pro Tyr Gly Gly Arg Met 
405 410 415 

Ser Glu He Ser Glu Phe Ala Lys Pro Phe Pro His Arg Ser Gly Asn 
420 425 430 

He Ala Lys He Gin Tyr Glu Val Asn Trp Asp Glu Leu Gly Val Glu 
435 440 445 

Ala Ala Asn Arg Tyr Leu Asn Phe Thr Arg Val Met Tyr Asp Tyr Met 
450 455 460 



Thr Pro Phe Val Ser Lys Asn Pro Arg Glu Ala Phe Leu Asn Tyr Arg 
465 470 475 480 



Asp Leu Asp lie Gly Val Asn Ser His Gly Lys Asn Ala Tyr Gly Glu 
485 490 495 

Gly Met Val Tyr Gly His Lys Tyr Phe Lys Glu Thr Asn Tyr Lys Arg 
500 505 510 

Leu Thr Met Val Lys Thr Arg Val Asp Pro Ser Asn Phe Phe Arg Asn 
515 520 525 

Glu Gin Ser lie Pro Thr Leu Ser Ser Ser Trp Lys 
530 535 540 



(2) INFORMATION FOR SEQ ID NO : 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 59: 
GGTAATGATC TCCTTTCTTG TTTGACC 



(2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
AGAGCGGCCG C TATATTAC A ACTTCTCCAC CATCACTCCT C 



(2) INFORMATION FOR SEQ ID NO: 61: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
GGTGATGTTA ATGATAATCT CCTC 



(2) INFORMATION FOR SEQ ID NO : 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 
AGAGCGGCCG CTACAATTCC TTCAACATGT AAATTTCCTC 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 
ACTTCCCGTA GAAACTCGGA GACTTTCACA CAATGC 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TCCATCCAAG ATCAATTCAT AAACTGTGTC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGAGCGGCCG CTTTCATGAA CCTAGCTTCT AGTAGG 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AGAGCGGCCG CGAAATGGCC CCCCTTTTAA AACGGGG 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGAGCGGCCG CAAATGATAT CTTCAGGTAA CTTTGTTCAC 



(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AGAGCGGCCG CATAATCAAA TAAATACACT TATGGTAACA CAG 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS': single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
AGAGCGGCCG CTGGTTTTGT ATTGAGGACT CAAAACAG 



(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iii) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: j oin ( 1 . . 57 0 , 801.. 1754) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 2 88 
Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 3 36 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

CGT TCG GTG GAT GTC GAT GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 43 2 
Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 4 80 
Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 52 8 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 



165 170 175 

ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTT AAT 570 
Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn 
180 185 190 

GGTATAATTG ATATCTCTAT TTTATATACT AATTAAATTT TATAGTGTGG ATCGGATAGT 63 0 

GATTTTGGTC CATCAATTAA AAACTTGGTG AACATAAAAT TAAC CAAGCA ATCAATTTAG 690 

ACAAGCAACA T AAT CAT AT A TATTTTTCTT ACATTTGTAT GTACCTGAAT ATTTATATTT 75 0 

ATGTTTATAT GTTCTCACTA TATTTTCACT TTTGTATTTG AAAATTTTTA GGA AAA 806 

Gly Lys 

ATT TTG GAT AGA AAA TTG ATG GGA GAA GAT CTC TAC TGG GCA ATA AAC 854 
lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

GGA GGA GGA GGA GGG AGC TAC GGC GTC GTA TTG GCC TAC AAA ATA AAC 902 
Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

CTT GTT GAA GTC CCA GAA AAC GTC ACC GTT TTC AGA ATC TCC CGG ACG 95 0 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

TTA GAA CAA AAT GCG ACG GAT ATC ATT CAC CGG TGG CAA CAA GTT GCA 998 
Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

CCG AAG CTT CCC GAC GAG CTT TTC ATA AGA ACA GTC ATT GAC GTA GTA 104 6 
Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

AAC GGC ACT GTT TCA TCT CAA AAG ACC GTC AGG ACA ACA TTC ATA GCA 10 94 
Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

ATG TTT CTA GGA GAC ACG ACA ACT CTA CTG TCG ATA TTA AAC CGG AGA 114 2 
Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

TTC CCA GAA TTG GGT TTG GTC CGG TCT GAC TGT ACC GAA ACA AGC TGG 1190 
Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

ATC CAA TCT GTG CTA TTC TGG ACA AAT ATC CAA GTT GGT TCG TCG GAG 12 3 8 
lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

ACA CTT CTA CTC CAA AGG AAT CAA CCC GTG AAC TAC CTC AAG AGG AAA 12 8 6 
Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 



TCA GAT TAG GTA CGT GAA 'CCG ATT TCA AGA ACC GGT TTA GAG TCA ATT 
Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 



1334 



TGG AAG AAA ATG ATC GAG CTT GAA ATT CCG ACA ATG GCT TTC AAT CCA 13 82 
Trp Lys Lys Met lie Glu Leu Glu He Pro Thr Met Ala Phe Asn Pro 
370 375 380 

TAC GGT GGT GAG ATG GGG AGG ATA TCA TTA CGG GTG ACT CCG TTC CCA 14 3 0 
Tyr Gly Gly Glu Met Gly Arg He Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 14 7 8 
Tyr Arg Ala Gly Asn Leu Trp Lys He Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 152 6 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 15 74 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT AAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 1622 
Asn Asn Arg Asp Val Asp Leu Gly He Asn Ser His Asn Gly Lys He 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 167 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 1718 
Asn Phe Glu Arg Leu Val Lys He Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAC AGT ATT CCT GTG TTA CCA TAA 17 57 

Phe Phe Arg Asn Glu His Ser He Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO : 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr 
15 10 



Ser Asn 
15 



Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 



Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie He Ala Ala Thr His 
50 55 60 

Glu Ser His Val Gin Ala Ala He Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 



Gin Met Lys He Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala He Leu Gly Glu Val Tyr Tyr Tyr He Trp Glu Lys Ser Lys 
130 135 140 



Thr Leu Ala Tyr 
145 

His He Ser Gly 



Thr Val Asp Asn 
180 

He Leu Asp Arg 
195 

Gly Gly Gly Gly 
210 

Leu Val Glu Val 
225 

Leu Glu Gin Asn 



Pro Lys Leu Pro 
260 

Asn Gly Thr Val 
275 

Met Phe Leu Gly 
290 



Pro Ala Gly He 
150 

Gly Gly Tyr Gly 
165 

Thr He Asp Ala 



Lys Leu Met Gly 
200 

Gly Ser Tyr Gly 
215 

Pro Glu Asn Val 
230 

Ala Thr Asp He 
245 

Asp Glu Leu Phe 



Ser Ser Gin Lys 
280 

Asp Thr Thr Thr 
295 



Cys Pro Thr Val 
155 

Asn Met Met Arg 
170 

Arg Met Val Asp 
185 

Glu Asp Leu Tyr 



Val Val Leu Ala 
220 

Thr Val Phe Arg 
235 

He His Arg Trp 
250 

He Arg Thr Val 



Thr Val Arg Thr 



Leu Leu Ser He 
300 



Gly Val Gly Gly 
160 

Lys Tyr Gly Leu 
175 

Val Asn Gly Lys 
190 

Trp Ala He Asn 
205 

Tyr Lys He Asn 



He Ser Arg Thr 
240 

Gin Gin Val Ala 
255 

He Asp Val Val 
270 

Thr Phe He Ala 
285 

Leu Asn Arg Arg 



Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 



lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 

Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 

Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 

Tyr Gly Gly Glu Met Gly Arg lie Ser Leu Arg Val Thr Pro Phe Pro 
385 390 395 400 

Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

Asn Asn Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

Phe Phe Arg Asn Glu His Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN: Colombia 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

ACT TCC CGT AGA AAC TCG GAG ACT TTC ACA CAA TGC CTA ACC TCA AAC 4 8 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

TCC GAC CCC AAA CAT CCC ATC TCC CCC GCT ATC TTC TTC TCC GGA AAT 96 
Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

GGC TCC TAC TCC TCC GTA TTA CAA GCC AAC ATC CGT AAC CTC CGC TTC 144 
Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 

AAC ACC ACC TCA ACT CCG AAA CCC TTC CTC ATA ATC GCC GCA ACA CAT 192 
Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 

GAA TCC CAT GTG CAA GCC GCG ATT ACT TGC GGG AAA CGC CAC AAC CTT 24 0 
Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 

CAG ATG AAA ATC AGA AGT GGA GGC CAC GAC TAC GAT GGC TTG TCA TAC 288 
Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

GTT ACA TAC TCT GGC AAA CCG TTC TTC GTC CTC GAC ATG TTT AAC CTC 336 
Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

CGT TCG GTG GAT GTC GAC GTG GCA AGT AAG ACC GCG TGG GTC CAA ACC 3 84 
Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

GGT GCC ATA CTC GGA GAA GTT TAT TAC TAT ATA TGG GAG AAG AGC AAA 43 2 
Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

ACC CTA GCT TAT CCC GCC GGA ATT TGT CCC ACG GTT GGT GTC GGT GGC 4 80 
Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 



CAT ATC AGT GGT GGA GGT TAC GGT AAC ATG ATG AGA AAA TAC GGT CTC 
His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 



528 



ACC GTA GAT AAT ACC ATC GAT GCA AGA ATG GTC GAC GTA AAT GGA AAA 
Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 



576 



ATT TTG GAT AGA 
lie Leu Asp Arg 
195 

GGA GGA GGA GGA 
Gly Gly Gly Gly 
210 

CTT GTT GAA GTC 
Leu Val Glu Val 
225 

TTA GAA CAA AAT 
Leu Glu Gin Asn 



CCG AAG CTT CCC 
Pro Lys Leu Pro 
260 

AAC GGC ACT GTT 
Asn Gly Thr Val 
275 

ATG TTT CTA GGA 
Met Phe Leu Gly 
290 

TTC CCA GAA TTG 
Phe Pro Glu Leu 
305 

ATC CAA TCT GTG 
lie Gin Ser Val 



ACA CTT CTA CTC 
Thr Leu Leu Leu 
340 

TCA GAT TAC GTA 
Ser Asp Tyr Val 
355 

TGG AAG AAA ATG 
Trp Lys Lys Met 
370 

TAC GGT GGT GAG 
Tyr Gly Gly Glu 



AAA TTG ATG 

Lys Leu Met 

GGG AGC TAC 
Gly Ser Tyr 
215 

CCA GAA AAC 
Pro Glu Asn 
230 

GCG ACG GAT 
Ala Thr Asp 
245 

GAC GAG CTT 
Asp Glu Leu 

TCA TCT CAA 
Ser Ser Gin 



GAC ACG ACA 
Asp Thr Thr 
295 

GGT TTG GTC 
Gly Leu Val 
310 

CTA TTC TGG 
Leu Phe Trp 
325 

CAA AGG AAT 
Gin Arg Asn 



CGT GAA CCG 
Arg Glu Pro 



ATC GAG CTT 
lie Glu Leu 
375 

ATG GGG AGG 
Met Gly Arg 



GGA GAA GAT 
Gly Glu Asp 
200 

GGC GTC GTA 
Gly Val Val 



GTC ACC GTT 
Val Thr Val 



ATC ATT CAC 
lie lie His 
250 

TTC ATA AGA 
Phe lie Arg 
265 

AAG ACC GTC 
Lys Thr Val 
280 

ACT CTA CTG 
Thr Leu Leu 



CGG TCT GAC 
Arg Ser Asp 



ACA AAT ATC 
Thr Asn lie 
330 

CAA CCC GTG 
Gin Pro Val 
345 

ATT TCA AGA 
lie Ser Arg 
360 

GAA ATT CCG 
Glu lie Pro 



ATA TCA TCT 
lie Ser Ser 



CTC TAC TGG GCA 
Leu Tyr Trp Ala 
205 

TTG GCC TAC AAA 
Leu Ala Tyr Lys 
220 

TTC AGA ATC TCC 
Phe Arg lie Ser 
235 

CGG TGG CAA CAA 
Arg Trp Gin Gin 



ACA GTC ATT GAC 
Thr Val lie Asp 
270 

AGG ACA ACA TTC 
Arg Thr Thr Phe 
285 

TCG ATA TTA AAC 
Ser lie Leu Asn 
300 

TGT ACC GAA ACA 
Cys Thr Glu Thr 
315 

CAA GTT GGT TCG 
Gin Val Gly Ser 



AAC TAC CTC AAG 
Asn Tyr Leu Lys 
350 

ACC GGT TTA GAG 
Thr Gly Leu Glu 
365 

ACA ATG GCT TTC 
Thr Met Ala Phe 
380 

ACG GTG ACT CCG 
Thr Val Thr Pro 



ATA AAC 624 
lie Asn 



ATA AAC 67 2 
lie Asn 



CGG ACG 72 0 
Arg Thr 
240 

GTT GCA 76 8 

Val Ala 

255 

GTA GTA 816 
Val Val 



ATA GCA 8 64 
lie Ala 



CGG AGA 912 
Arg Arg 



AGC TGG 96 0 
Ser Trp 
320 

TCG GAG 1008 

Ser Glu 

335 

AGG AAA 1056 
Arg Lys 



TCA ATT 1104 
Ser lie 



AAT CCA 1152 
Asn Pro 



TTC CCA 12 0 0 
Phe Pro 



385 



390 



395 



400 



TAC AGA GCC GGT AAT CTC TGG AAG ATT CAG TAC GGT GCG AAT TGG AGA 124 8 
Tyr Arg Ala Gly Asn Leu Trp Lys lie Gin Tyr Gly Ala Asn Trp Arg 
405 410 415 

GAT GAG ACT TTA ACC GAC CGG TAC ATG GAA TTG ACG AGG AAG TTG TAC 12 96 
Asp Glu Thr Leu Thr Asp Arg Tyr Met Glu Leu Thr Arg Lys Leu Tyr 
420 425 430 

CAA TTC ATG ACA CCA TTT GTT TCC AAG AAT CCG AGA CAA TCG TTT TTC 13 44 
Gin Phe Met Thr Pro Phe Val Ser Lys Asn Pro Arg Gin Ser Phe Phe 
435 440 445 

AAT TAC CGT GAT GTT GAT TTG GGT ATT AAT TCT CAT AAT GGT AAA ATC 13 92 
Asn Tyr Arg Asp Val Asp Leu Gly lie Asn Ser His Asn Gly Lys lie 
450 455 460 

AGT AGT TAT GTG GAA GGT AAA CGT TAC GGG AAG AAG TAT TTC GCA GGT 144 0 
Ser Ser Tyr Val Glu Gly Lys Arg Tyr Gly Lys Lys Tyr Phe Ala Gly 
465 470 475 480 

AAT TTC GAG AGA TTG GTG AAG ATT AAG ACG AGA GTT GAT AGT GGT AAT 14 8 8 
Asn Phe Glu Arg Leu Val Lys lie Lys Thr Arg Val Asp Ser Gly Asn 
485 490 495 

TTC TTT AGG AAC GAA CAG AGT ATT CCT GTG TTA CCA TAA 152 7 

Phe Phe Arg Asn Glu Gin Ser lie Pro Val Leu Pro 
500 505 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Arg Arg Asn Ser Glu Thr Phe Thr Gin Cys Leu Thr Ser Asn 
15 10 15 

Ser Asp Pro Lys His Pro lie Ser Pro Ala lie Phe Phe Ser Gly Asn 
20 25 30 

Gly Ser Tyr Ser Ser Val Leu Gin Ala Asn lie Arg Asn Leu Arg Phe 
35 40 45 



Asn Thr Thr Ser Thr Pro Lys Pro Phe Leu lie lie Ala Ala Thr His 
50 55 60 



Glu Ser His Val Gin Ala Ala lie Thr Cys Gly Lys Arg His Asn Leu 
65 70 75 80 



Gin Met Lys lie Arg Ser Gly Gly His Asp Tyr Asp Gly Leu Ser Tyr 
85 90 95 

Val Thr Tyr Ser Gly Lys Pro Phe Phe Val Leu Asp Met Phe Asn Leu 
100 105 110 

Arg Ser Val Asp Val Asp Val Ala Ser Lys Thr Ala Trp Val Gin Thr 
115 120 125 

Gly Ala lie Leu Gly Glu Val Tyr Tyr Tyr lie Trp Glu Lys Ser Lys 
130 135 140 

Thr Leu Ala Tyr Pro Ala Gly lie Cys Pro Thr Val Gly Val Gly Gly 
145 150 155 160 

His lie Ser Gly Gly Gly Tyr Gly Asn Met Met Arg Lys Tyr Gly Leu 
165 170 175 

Thr Val Asp Asn Thr lie Asp Ala Arg Met Val Asp Val Asn Gly Lys 
180 185 190 

lie Leu Asp Arg Lys Leu Met Gly Glu Asp Leu Tyr Trp Ala lie Asn 
195 200 205 

Gly Gly Gly Gly Gly Ser Tyr Gly Val Val Leu Ala Tyr Lys lie Asn 
210 215 220 

Leu Val Glu Val Pro Glu Asn Val Thr Val Phe Arg lie Ser Arg Thr 
225 230 235 240 

Leu Glu Gin Asn Ala Thr Asp lie lie His Arg Trp Gin Gin Val Ala 
245 250 255 

Pro Lys Leu Pro Asp Glu Leu Phe lie Arg Thr Val lie Asp Val Val 
260 265 270 

Asn Gly Thr Val Ser Ser Gin Lys Thr Val Arg Thr Thr Phe lie Ala 
275 280 285 

Met Phe Leu Gly Asp Thr Thr Thr Leu Leu Ser lie Leu Asn Arg Arg 
290 295 300 

Phe Pro Glu Leu Gly Leu Val Arg Ser Asp Cys Thr Glu Thr Ser Trp 
305 310 315 320 

lie Gin Ser Val Leu Phe Trp Thr Asn lie Gin Val Gly Ser Ser Glu 
325 330 335 



Thr Leu Leu Leu Gin Arg Asn Gin Pro Val Asn Tyr Leu Lys Arg Lys 
340 345 350 



Ser Asp Tyr Val Arg Glu Pro lie Ser Arg Thr Gly Leu Glu Ser lie 
355 360 365 

Trp Lys Lys Met lie Glu Leu Glu lie Pro Thr Met Ala Phe Asn Pro 
370 375 380 



Tyr Gly Gly Glu 
385 

Tyr Arg Ala Gly 



Asp Glu Thr Leu 
420 

Gin Phe Met Thr 
435 

Asn Tyr Arg Asp 
450 

Ser Ser Tyr Val 
465 

Asn Phe Glu Arg 



Phe Phe Arg Asn 
500 



Met Gly Arg lie 
390 

Asn Leu Trp Lys 
405 

Thr Asp Arg Tyr 



Pro Phe Val Ser 
440 

Val Asp Leu Gly 
455 

Glu Gly Lys Arg 
470 

Leu Val Lys lie 
485 

Glu Gin Ser lie 



Ser Ser Thr Val 
395 

lie Gin Tyr Gly 
410 

Met Glu Leu Thr 
425 

Lys Asn Pro Arg 



lie Asn Ser His 
460 

Tyr Gly Lys Lys 
475 

Lys Thr Arg Val 
490 

Pro Val Leu Pro 
505 



Thr Pro Phe Pro 
400 

Ala Asn Trp Arg 
415 

Arg Lys Leu Tyr 
430 

Gin Ser Phe Phe 
445 

Asn Gly Lys lie 



Tyr Phe Ala Gly 
480 

Asp Ser Gly Asn 
495 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(B) STRAIN : Colombia 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1527 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



TCC ATC CAA GAT CAA TTC ATA AAC TGT GTC AAA AGA AAC ACA CAT GTT 
Ser lie Gin Asp Gin Phe lie Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 



TCT TTT CCA CTC GAG AAA ACG TTA TTC ACC CCT GCG AAA AAC GTC TCT 
Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 



TTG TTC AAC CAA GTC CTT GAA TCG ACG GCT CAA AAT CTC CAG TTC TTG 
Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 



GCA AAA TCC ATG CCT AAA CCG GGA TTC ATA TTC AGA CCG ATT CAC CAG 
Ala Lys Ser Met Pro Lys Pro Gly Phe lie Phe Arg Pro lie His Gin 
50 55 60 



TCT CAA GTC CAA GCT TCC ATC ATT TGT TCA AAG AAA CTC GGA ATT CAT 
Ser Gin Val Gin Ala Ser lie lie Cys Ser Lys Lys Leu Gly lie His 
65 70 75 80 



TTT CGT GTT AGA AGT GGC GGT CAC GAT TTC GAG GCC TTG TCT TAT GTT 
Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 



TCA CGG ATT GAA AAA CCG TTT ATA TTA CTC GAC CTG TCA AAA TTG AAA 
Ser Arg lie Glu Lys Pro Phe lie Leu Leu Asp Leu Ser Lys Leu Lys 
100 105 110 



CAA ATC AAT GTT GAT ATT GAA TCC AAT AGT GCT TGG GTT CAA CCT GGT 
Gin lie Asn Val Asp lie Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

GCT ACG CTT GGT GAG CTT TAC TAC AGA ATT GCA GAG AAG AGC AAG ATC 
Ala Thr Leu Gly Glu Leu Tyr Tyr Arg lie Ala Glu Lys Ser Lys lie 
130 135 140 



CAT GGA TTT CCC GCG GGT TTG TGC ACA AGT GTA GGC ATA GGT GGG TAT 
His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 
145 150 155 160 



ATG ACA GGC GGT GGA TAC GGT ACC TTG ATG AGG AAG TAT GGT CTT GCG 
Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 



GGA GAT AAT GTT CTA GAC GTA AAG ATG GTT GAT GCA AAT GGT AAA TTA 
Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 



CTC GAC AGA GCC GCG ATG GGT GAG GAC CTA TTT TGG GCG ATT AGA GGA 
Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala lie Arg Gly 
195 200 205 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



GGC GGT GGA GCG AGT TTC GGG ATA GTT CTA GCA TGG AAG ATC AAG CTT 672 
Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 
210 215 220 

GTT CCT GTT CCT AAG ACT GTT ACC GTC TTC ACT GTC ACC AAA ACG TTA 72 0 
Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235* 240 

GAA CAA GAC GCA AGA TTG AAG ACT ATT TCT AAG TGG CAA CAA ATT TCA 768 
Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 



TCC AAG ATT ATT GAA GAG ATA CAC ATC CGA GTG GTA CTC AGA GCA GCT 816 
Ser Lys lie lie Glu Glu He His He Arg Val Val Leu Arg Ala Ala 
260 265 270 

GGA AAT GAT GGA AAC AAG ACT GTG ACA ATG ACC TAC CTA GGT CAG TTT 864 
Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

CTT GGC GAG AAA GGC ACC TTG CTG AAG GTT ATG GAG AAG GCT TTT CCA 912 
Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

GAA CTA _ GGG TTA ACT CAA AAG GAT TGT ACT GAA ATG AGC TGG ATT GAA 960 
Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp He Glu 
305 310 315 320 

GCC GCC CTT TTC CAT GGT GGA TTT CCA ACA GGT TCT CCT ATT GAA ATT 10 0 8 
Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 

TTG CTT CAG CTC AAG TCG CCT CTA GGA AAA GAT TAC TTC AAA GCA ACG 105 6 
Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

TCG GAT TTC GTT AAA GAA CCT ATT CCT GTG ATA GGC TTC AAA GGA ATA 1104 
Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly- Phe Lys Gly He 
355 360 365 

TTC AAA AGA TTG ATT GAA GGA AAC ACA ACA TTT CTG AAC TGG ACT CCT 1152 
Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 

TAC GGT GGT ATG ATG TCG AAA ATC CCT GAA TCT GCG ATC CCA TTT CCG 12 0 0 
Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He Pro Phe Pro 
385 390 395 400 

CAT AGA AAC GGA ACC CTC TTC AAG ATT CTC TAT TAC GCG AAC TGG CTA 124 8 
His Arg Asn Gly Thr Leu Phe Lys He Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 



GAG AAT GAC AAG ACA TCG AGT AGA AAA ATC AAC TGG ATC AAA GAG ATA 
Glu Asn Asp Lys Thr Ser Ser Arg Lys lie Asn Trp lie Lys Glu lie 
420 425 430 



1296 



TAC AAT TAC ATG GCG CCT TAT GTC TCA AGC AAT CCA AGA CAA GCA TAT 1344 
Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

GTG AAC TAC AGA GAT CTA GAC TTC GGA CAG AAC AAG AAC AAC GCA AAG 13 92 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 

450 455 460 

GTT AAC TTC ATT GAA GCT AAA ATC TGG GGA CCT AAG TAC TTC AAA GGC 14 4 0 

Val Asn Phe He Glu Ala Lys He Trp Gly Pro Lys Tyr Phe Lys Gly 

465 470 475 480 

AAT TTT GAC AGA TTG GTG AAG ATT AAA ACC AAG GTT GAT CCA GAG AAC 14 8 8 
Asn Phe Asp Arg Leu Val Lys He Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

TTC TTC AGG CAC GAG CAG AGT ATC CCA CCT ATG CCC TAC TAG 1530 
Phe Phe Arg His Glu Gin Ser He Pro Pro Met Pro Tyr 
500 505 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser He Gin Asp Gin Phe He Asn Cys Val Lys Arg Asn Thr His Val 
15 10 15 

Ser Phe Pro Leu Glu Lys Thr Leu Phe Thr Pro Ala Lys Asn Val Ser 
20 25 30 

Leu Phe Asn Gin Val Leu Glu Ser Thr Ala Gin Asn Leu Gin Phe Leu 
35 40 45 

Ala Lys Ser Met Pro Lys Pro Gly Phe He Phe Arg Pro He His Gin 
50 55 60 

Ser Gin Val Gin Ala Ser He He Cys Ser Lys Lys Leu Gly He His 
65 70 75 80 

Phe Arg Val Arg Ser Gly Gly His Asp Phe Glu Ala Leu Ser Tyr Val 
85 90 95 



Ser Arg He Glu Lys Pro Phe He Leu Leu Asp Leu Ser Lys Leu Lys 



100 



105 



110 



Gin lie Asn Val Asp lie Glu Ser Asn Ser Ala Trp Val Gin Pro Gly 
115 120 125 

Ala Thr Leu Gly Glu Leu Tyr Tyr Arg lie Ala Glu Lys Ser Lys lie 
130 135 140 

His Gly Phe Pro Ala Gly Leu Cys Thr Ser Val Gly lie Gly Gly Tyr 
145 150 155 160 

Met Thr Gly Gly Gly Tyr Gly Thr Leu Met Arg Lys Tyr Gly Leu Ala 
165 170 175 

Gly Asp Asn Val Leu Asp Val Lys Met Val Asp Ala Asn Gly Lys Leu 
180 185 190 

Leu Asp Arg Ala Ala Met Gly Glu Asp Leu Phe Trp Ala lie Arg Gly 
195 200 205 

Gly Gly Gly Ala Ser Phe Gly lie Val Leu Ala Trp Lys lie Lys Leu 
210 215 220 

Val Pro Val Pro Lys Thr Val Thr Val Phe Thr Val Thr Lys Thr Leu 
225 230 235 240 

Glu Gin Asp Ala Arg Leu Lys Thr lie Ser Lys Trp Gin Gin lie Ser 
245 250 255 

Ser Lys lie lie Glu Glu lie His lie Arg Val Val Leu Arg Ala Ala 
260 265 270 

Gly Asn Asp Gly Asn Lys Thr Val Thr Met Thr Tyr Leu Gly Gin Phe 
275 280 285 

Leu Gly Glu Lys Gly Thr Leu Leu Lys Val Met Glu Lys Ala Phe Pro 
290 295 300 

Glu Leu Gly Leu Thr Gin Lys Asp Cys Thr Glu Met Ser Trp lie Glu 
305 310 315 320 

Ala Ala Leu Phe His Gly Gly Phe Pro Thr Gly Ser Pro He Glu He 
325 330 335 

Leu Leu Gin Leu Lys Ser Pro Leu Gly Lys Asp Tyr Phe Lys Ala Thr 
340 345 350 

Ser Asp Phe Val Lys Glu Pro He Pro Val He Gly Phe Lys Gly He 
355 360 365 

Phe Lys Arg Leu He Glu Gly Asn Thr Thr Phe Leu Asn Trp Thr Pro 
370 375 380 



Tyr Gly Gly Met Met Ser Lys He Pro Glu Ser Ala He 



Pro Phe Pro 



385 



390 



395 



400 



His Arg Asn Gly Thr Leu Phe Lys lie Leu Tyr Tyr Ala Asn Trp Leu 
405 410 415 

Glu Asn Asp Lys Thr Ser Ser Arg Lys lie Asn Trp lie Lys Glu lie 
420 425 430 

Tyr Asn Tyr Met Ala Pro Tyr Val Ser Ser Asn Pro Arg Gin Ala Tyr 
435 440 445 

Val Asn Tyr Arg Asp Leu Asp Phe Gly Gin Asn Lys Asn Asn Ala Lys 
450 455 460 

Val Asn Phe lie Glu Ala Lys lie Trp Gly Pro Lys Tyr Phe Lys Gly 
465 470 475 480 

Asn Phe Asp Arg Leu Val Lys lie Lys Thr Lys Val Asp Pro Glu Asn 
485 490 495 

Phe Phe Arg His Glu Gin Ser lie Pro Pro Met Pro Tyr 
500 505 

(2) INFORMATION FOR SEQ ID NO : 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 9 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



Met Glu Asn Lys Thr Pro lie Phe Phe Ser Leu Ser lie Phe Leu Ser 
15 10 15 



Leu Leu Asn Cys Ala Leu Gly Gly Asn Asp Leu Leu Ser Cys Leu Thr 
20 25 30 



Phe Asn Gly Val Arg Asn His Thr Val Phe Ser Ala Asp Ser Asp Ser 
35 40 45 



Asp Phe Asn Arg Phe Leu His Leu Ser lie Gin Asn Pro Leu Phe Gin 
50 55 60 



Asn Ser Leu lie Ser Lys Pro Ser Ala lie He Leu Pro Gly Ser Lys 
65 70 75 80 



Glu Glu Leu Ser Asn Thr lie Arg Cys lie Arg Lys Gly Ser Trp Thr 
85 90 95 



lie Arg Leu Arg Ser Gly Gly His Ser Tyr Glu Gly Leu Ser Tyr Thr 
100 105 110 



Ser Asp Thr Pro Phe lie Leu lie Asp Leu Met Asn Leu Asn Arg Val 
115 120 125 



Ser He Asp Leu Glu Ser Glu Thr Ala Trp Val Glu Ser Gly Ser Thr 
130 135 140 



Leu Gly Glu Leu Tyr Tyr Ala He Thr Glu Ser Ser Ser Lys Leu Gly 
145 150 155 160 



Phe Thr Ala Gly Trp Cys Pro Thr Val Gly Thr Gly Gly His He Ser 
165 170 175 



Gly Gly Gly Phe Gly Met Met Ser Arg Lys Tyr Gly Leu Ala Ala Asp 
180 185 190 



Asn Val Val Asp Ala lie Leu He Asp Ala Asn Gly Ala He Leu Asp 
195 200 205 



Arg Gin Ala Met Gly Glu Asp Val Phe Trp Ala He Arg Gly Gly Gly 
210 215 220 



Gly Gly Val Trp Gly Ala He Tyr Ala Trp Lys He Lys Leu Leu Pro 
225 230 235 240 



Val Pro Glu Lys Val Thr Val Phe Arg Val Thr Lys Asn Val Ala He 
245 250 255 



Asp Glu Ala Thr Ser Leu Leu His Lys Trp Gin Phe Val Ala Glu Glu 
260 265 270 



Leu Glu Glu Asp Phe Thr Leu Ser Val Leu Gly Gly Ala Asp Glu Lys 
275 280 285 



Gin Val Trp Leu Thr Met Leu Gly Phe His Phe Gly Leu Lys Thr Val 
290 295 300 



Ala Lys Ser Thr Phe Asp Leu Leu Phe Pro Glu Leu Gly Leu Val Glu 
305 310 315 320 



Glu Asp Tyr Leu Glu Met Ser Trp Gly Glu Ser Phe Ala Tyr Leu Ala 
325 330 335 



Gly Leu Glu Thr Val Ser Gin Leu Asn Asn Arg Phe Leu Lys Phe Asp 
340 345 350 



Glu Arg Ala Phe Lys Thr Lys Val Asp Leu Thr Lys Glu Pro Leu Pro 
355 360 365 



Ser Lys Ala Phe Tyr Gly Gly Leu Leu Glu Arg Leu Ser Lys Glu Pro 
370 375 380 



Asn Gly Phe lie Ala Leu Asn Gly Phe Gly Gly Gin Met Ser Lys lie 
385 390 395 400 



Ser Ser Asp Phe Thr Pro Phe Pro His Arg Ser Gly Thr Arg Leu Met 
405 410 415 



Val Glu Tyr lie Val Ala Trp Asn Gin Ser Glu Gin Lys Lys Lys Thr 
420 425 430 



Glu Phe Leu Asp Trp Leu Glu Lys Val Tyr Glu Phe Met Lys Pro Phe 
435 440 445 



Val Ser Lys Asn Pro Arg Leu Gly Tyr Val Asn His lie Asp Leu Asp 
450 455 460 



Leu Gly Gly lie Asp Trp Gly Asn Lys Thr Val Val Asn Asn Ala lie 
465 470 475 480 



Glu lie Ser Arg Ser Trp Gly Glu Ser Tyr Phe Leu Ser Asn Tyr Glu 
485 490 495 



Arg Leu lie Arg Ala Lys Thr Leu lie Asp Pro Asn Asn Val Phe Asn 
500 505 510 



His Pro Gin Ser lie Pro Pro Met Ala Asn Phe Asp Tyr Leu Glu Lys 



515 



520 



525 



Thr Leu Gly Ser Asp Gly Gly Glu Val Val lie 
530 535 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Met Met Cys Arg Ser Leu Thr Leu Arg Phe Phe Leu Phe lie Val Leu 
15 10 15 



Leu Gin Thr Cys Val Arg Gly Gly Asp Val Asn Asp Asn Leu Leu Ser 
20 25 30 



Ser Cys Leu Asn Ser His Gly Val His Asn Phe Thr Thr Leu Ser Thr 
35 40 45 



Asp Thr Asn Ser Asp Tyr Phe Lys Leu Leu His Ala Ser Met Gin Asn 
50 55 60 



Pro Leu Phe Ala Lys Pro Thr Val Ser Lys Pro Ser Phe lie Val Met 
65 70 75 80 



Pro Gly Ser Lys Glu Glu Leu Ser Ser Thr Val His Cys Cys Thr Arg 
85 90 95 



Glu Ser Trp Thr lie Arg Leu Arg Ser Gly Gly His Ser Tyr Glu Gly 
100 105 110 



Leu Ser Tyr Thr Ala Asp Thr Pro Phe Val He Val Asp Met Met Asn 
115 120 125 



Leu Asn Arg He Ser He Asp Val Leu Ser Glu Thr Ala Trp Val Glu 



130 135 140 



Ser Gly Ala Thr Leu Gly Glu Leu Tyr Tyr Ala lie Ala Gin Ser Thr 
145 150 155 160 



Asp Thr Leu Gly Phe Thr Ala Gly Trp Cys Pro Thr Val Gly Ser Gly 
165 170 175 



Gly His lie Ser Gly Gly Gly Phe Gly Met Met Ser Arg Lys Tyr Gly 
180 185 190 



Leu Ala Ala Asp Asn Val Val Asp Ala lie Leu lie Asp Ser Asn Gly 
195 200 205 



Ala lie Leu Asp Arg Glu Lys Met Gly Asp Asp Val Phe Trp Ala lie 
210 215 220 



Arg Gly Gly Gly Gly Gly Val Trp Gly Ala lie Tyr Ala Trp Lys lie 
225 230 235 240 



Lys Leu Leu Pro Val Pro Glu Lys Leu Thr Val Phe Arg Val Thr Lys 
245 250 255 



Asn Val Gly lie Glu Asp Ala Ser Ser Leu Leu His Lys Trp Gin Tyr 
260 265 270 



Val Ala Asp Glu Leu Asp Glu Asp Phe Thr Val Ser Val Leu Gly Gly 
275 280 285 



Val Asn Gly Asn Asp Ala Trp Leu Met Phe Leu Gly Leu His Leu Gly 
■ 290 295 300 



Arg Lys Asp Ala Ala Lys Thr lie lie Asp Glu Lys Phe Pro Glu Leu 
305 310 315 320 



Gly Leu Val Asp Lys Glu Phe Gin Glu Met Ser Trp Gly Glu Ser Met 
325 330 335 



Ala Phe Leu Ser Gly Leu Asp Thr lie Ser Glu Leu Asn Asn Arg Phe 
340 345 350 



Leu Lys Phe Asp Glu Arg Ala Phe Lys Thr Lys Val Asp Phe Thr Lys 
355 360 365 



Val Ser Val Pro Leu 
370 



Ser Glu Gin Pro Gly 
385 



Met Ser Glu lie Ser 
405 



Thr Lys Leu Met Phe 
420 



Ser Lys lie Gly Glu 
435 



Leu Glu Pro Phe Val 
450 



lie Asp Leu Asp lie 
465 

Thr Asn Ala Val Glu 
485 



Ser Asn Tyr Glu Arg 
500 



Asn Val Phe Asn His 
515 



Glu lie Tyr Met Leu 
530 



Asn Val Phe Arg His His 
375 



Gly Phe lie Ala Leu Asn 
390 395 



Thr Asp Phe Thr Pro Phe 
410 



Glu Tyr lie lie Ala Trp 
425 



Phe Ser Glu Trp Leu Ala 
440 



Ser Lys Glu Pro Arg Val 
455 



Gly Gly lie Asp Trp Arg 
470 475 

lie Ala Arg Asn Trp Gly 
490 



Leu Val Lys Ala Lys Thr 
505 



Pro Gin Ser lie Pro Pro 
520 



Lys Glu Leu 
535 



Ala Leu Glu Met Leu 
380 



Gly Phe Gly Gly Lys 
400 



Pro His Arg Lys Gly 
415 



Asn Gin Asp Glu Glu 
430 



Lys Phe Tyr Asp Tyr 
445 



Gly Tyr Val Asn His 
460 



Asn Lys Ser Ser Thr 
480 

Glu Arg Tyr Phe Ser 
495 



Leu lie Asp Pro Asn 
510 



Met Met Lys Phe Glu 
525 



